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METHOD FOR DETERMINING OVERLAY ERROR BY COMPARISON BETWEEN 
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TARGETS, by Walter D. Mieher et al., filed 5 December 2002, (2) Application No. 
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ACQUIRED FROM MULTIPLE OVERLAY MEASUREMENT PATTERNS, by Walter 
D. Mieher, filed 22 February 2003, and (5) Application No. 60/498,524, filed 27 August 
2003, entitled "METHOD AND APPARATUS COMBINING IMAGING AND 
SCATTEROMETRY FOR OVERLAY METROLOGY", by Mike Adel. These applications 
5 are herein incorporated by reference in their entirety. 

BACKGROUND OF THE INVENTION 

10 The present invention relates to determination of overlay between structures formed 

in single or multiple layers. More particularly, it relates to determining overlay based on 
diffraction of radiation interacting with such structures. 

In various manufacturing and production environments, there is a need to control 
alignment between various layers of samples, or within particular layers of such samples. 
15 For example, in the semiconductor manufacturing industry, electronic devices may be 
produced by fabricating a series of layers on a substrate, some or all of the layers including 
various structures. The relative position of such structures both within particular layers and 
with respect to structures in other layers is relevant or even critical to the performance of 
completed electronic devices. 

20 The relative position of structures within such a sample is sometimes called overlay. 

Various technology and processes for measuring overlay have been developed and employed 
with varying degrees of success. More recently, various efforts have been focused on 
utilizing radiation scatterometry as a basis for overlay metrology. 
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Certain existing approaches to determining overlay from scatterometry 
measurements concentrate on comparison of the measured spectra to calculated theoretical 
spectra based on model shape profiles, overlay, and film stack, and material optical 
properties (n,k dispersion curves), or comparison to a reference signal from a calibration 
5 wafer. 

Existing approaches have several associated disadvantages. For example, a relatively 
large number of parameters must be included in the profile, overlay, and film modeling to 
accurately determine the overlay. For example, in some approaches using simple trapezoidal 
models for both the upper and lower layer profiles, the minimum number of pattern 

10 parameters that must be included is seven, including overlay. If film thicknesses variation is 
included in the model, the number of parameters increases correspondingly. A large number 
of parameters could require increased processing resources, may introduce corresponding 
errors, and may delay the results, thereby possibly decreasing throughput and increasing 
inefficiencies and costs. For example, comparison of a measured spectrum to calculated 

15 reference spectra takes longer with more parameters, whether a library-based approach is 
used or a regression approach is used. 

Another disadvantage of certain existing approaches to determination of overlay 
based on scatterometry is the detailed knowledge of the film stack, film materials, and 
pattern element profiles that may be required to determine accurate theoretical spectra to 
20 compare to the measured spectra. 

Yet another disadvantage of certain existing approaches to determination of overlay 
based on scatterometry is the accurate knowledge of the scatterometry optical system that 
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may be required to determine accurate theoretical spectra to compare to the measured 
spectra. 

Therefore, in light of the deficiencies of existing approaches to determination of 
overlay based on scatterometry, there is a need for improved systems and methods for 
determination of overlay based on scatterometry. 
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SUMMARY OF THE INVENTION 

Accordingly, mechanisms are provided for determining overlay error between two 
layers of a sample. In one embodiment, a method for determining overlay between a 
plurality of first structures in a first layer of a sample and a plurality of second structures in a 
second layer of the sample is disclosed. Targets A, B, C and D that each include a portion of 
the first and second structures are provided. Target A is designed to have an offset Xa 
between its first and second structures portions; target B is designed to have an offset Xb 
between its first and second structures portions; target C is designed to have an offset Xc 
between its first and second structures portions; and target D is designed to have an offset 
Xd between its first and second structures portions. Each of the offsets Xa, Xb, Xc and Xd 
is preferably different from zero; Xa is an opposite sign and differs from Xb; and Xc is an 
opposite sign and differs from Xd. The targets A, B, C and D are illuminated with 
electromagnetic radiation to obtain spectra Sa, Sb, Sc, and So from targets A, B, C, and D, 
respectively. Any overlay error between the first structures and the second structures is then 
determined using a linear approximation based on the obtained spectra Sa, Sb, Sc, and So- 

In general, an error offset E may be determined by analyzing at least the measured 
spectra from four or more targets A, B, C, and D each having offsets between two patterned 
layers, such as offsets Xa through Xd. This analysis can be performed without comparing 
any of the spectra to a known or reference spectra. In other words, the error determination 
techniques of the present invention do not require a calibration operation. 

In a specific implementation, determining any overlay error is accomplished by (i) 
determining a difference spectrum Dl from the spectra Sa and Sb, (h) determining a 
difference spectrum D2 from the spectra Sc and S D , (hi) and determining any overlay error 
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by performing a linear approximation based on the difference spectra Dl and D2. In a 
further aspect, the linear approximation is based on a property PI of the difference spectrum 
Dl and a property P2 of the difference spectrum D2. 

In one target implementation, each of the targets A, B, C, and D comprises a grating 
structure Gal having periodic structures with a period Tal disposed at least partially within 
the first layer and a grating structure Ga2 having periodic structures with a period Ta2 
disposed at least partially within the second layer. The first period Tal and the second 
period Ta2 are substantially identical, and the offsets Xa, Xb, Xc, and Xd are each produced 
by offsetting the structures with the period Tal of the grating structure Gal with respect to 
the structures with the period Ta2 of the grating structure Ga2 by the sum of a first distance 
F and a second distance fO, wherein the second distance fO has a smaller absolute value than 
the first distance F. 

In another aspect, the invention pertains to an optical system operable to determine 
overlay error between two layers of a sample. The system generally includes one or more 
processors configured to perform one or more of the described method operations. 

These and other features and advantages of the present invention will be presented in 
more detail in the following specification of the invention and the accompanying figures 
which illustrate by way of example the principles of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the relative distribution of designed overlay offsets Xa, Xb, Xc, and 
Xd for corresponding interlayer patterns (overlay targets) A, B, C, and D according to an 
embodiment of the present invention. 

Fig. 2(a) is a side view illustration of a patterned top layer L2 being offset by an 
amount +F from a patterned bottom layer LI in accordance with one embodiment of the 
present invention. 

Fig. 2(b) is a side view illustration of a patterned top layer L2 being offset by an 
amount -F from a patterned bottom layer LI in accordance with one embodiment of the 
present invention. 

Fig. 2(c) is a side view illustration of a patterned top layer L2 being offset by an 
amount +F+fO from a patterned bottom layer LI in accordance with one embodiment of the 
present invention. 

Fig. 2(d) is a side view illustration of a patterned top layer L2 being offset by an 
amount -F+fO from a patterned bottom layer LI in accordance with one embodiment of the 
present invention. 

Fig. 2(e) is a side view illustration of a patterned top layer L2 being offset by an 
amount +F+fO+E from a patterned bottom layer LI in accordance with one embodiment of 
the present invention. 
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Fig. 2(f) is a side view illustration of a patterned top layer L2 being offset by an 
amount -F+fO+E from a patterned bottom layer LI in accordance with one embodiment of 
the present invention. 

Fig. 3(a) is a flow diagram illustrating a procedure for determining overlay in 
accordance with one embodiment of the present invention. 

Fig. 3(b) shows a graphical representation of an approach to determination of overlay 
according to an embodiment of the present invention. 

Fig. 4 is a diagrammatic representation of a conventional microscopic imaging 

system. 

Fig. 5(a) is diagrammatic representation of a microscopic imaging system having a 
numerical aperture (NA) optimized for scattering characteristics in accordance with a first 
embodiment of the present invention. 

Fig. 5(b) is diagrammatic representation of a microscopic imaging system having a 
numerical aperture (NA) optimized for scattering characteristics in accordance with a second 
embodiment of the present invention. 

Fig. 5(c) is diagrammatic representation of a microscopic imaging system having a 
numerical aperture (NA) optimized for scattering characteristics in accordance with a third 
embodiment of the present invention. 

Fig. 5(d) is diagrammatic representation of a microscopic imaging system having a 
numerical aperture (NA) optimized for scattering characteristics in accordance with a fourth 
embodiment of the present invention. 
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Fig, 5(e) is a top view representation of an imaging spectrometer, multiple site field- 
of-view example in accordance with one embodiment of the present invention. 

Figure 6 is a diagrammatic representation of a system for selecting one or more 
wavelength ranges in accordance with one embodiment of the present invention. 

Figure 7 is a diagrammatic representation of a simultaneous, multiple angle of 
incidence ellipsometer. 

Figure 8 is a schematic view of a spectroscopic scatterometer system in accordance 
with one embodiment of the present invention. 

Figure 9(a) shows a plurality of targets placed substantially-collinearly along either 
an X-direction or a Y-direction in accordance with a first embodiment of the present 
invention. 

Figure 9(b) shows four targets disposed collinearly along the X-dimension, and four 
targets disposed collinearly along the Y-dimension in accordance with a second embodiment 
of the present invention. 

Figure 10 shows an example incidence line and field of view for use in a technique 
for scatterometric overlay determination using an incidence line in accordance with one 
embodiment of the present invention. 

Figure 1 la is a top view representation of a first combination imaging and 
scatterometry target embodiment. 
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Figure lib is a top view representation of a second combination imaging and 
scatterometry target embodiment. 

Figure 11c is a top view representation of a third combination imaging and 
scatterometry target embodiment. 

Fig. 12 is a diagram of a combined mark, in accordance with one embodiment of the 
present invention. 

Figs. 13A - 13D show variations of a combined metrology tool, in accordance with 
several embodiments of the present invention. 

Fig. 14 is a flow diagram using a combined metrology tool, in accordance with one 
embodiment of the present invention. 
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DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS 

Reference will now be made in detail to a specific embodiment of the invention. An 
example of this embodiment is illustrated in the accompanying drawings. While the 
invention will be described in conjunction with this specific embodiment, it will be 
understood that it is not intended to limit the invention to one embodiment. On the contrary, 
it is intended to cover alternatives, modifications, and equivalents as may be included within 
the spirit and scope of the invention as defined by the appended claims. In the following 
description, numerous specific details are set forth in order to provide a thorough 
understanding of the present invention. The present invention may be practiced without 
some or all of these specific details. In other instances, well known process operations have 
not been described in detail in order not to unnecessarily obscure the present invention. 

An aspect of the present invention provides a set of four or more scatterometry 
overlay targets which have been formed on a sample, such as a semiconductor device. A 
pattern could also be described as a "pattern or interlayer pattern", with the two terms being 
synonymous under most circumstances. In a particular implementation, the sample has two 
or more layers of a semiconductor device, and the targets are utilized to provide a measure of 
the placement accuracy of various structures comprised in the device. In general, placement 
accuracy is characterized by measurement of an overlay error between two different layers 
of the semiconductor device. 

In a specific embodiment, a set of four targets are provided, and each target includes 
two sets of structures on two different layers which are offset from each other. In a specific 
implementation, an offset may be defined as the sum or the difference of two separate 
distances: a first distance F and a second distance fO, with F being greater than fO . Denoting 
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the four targets as "target A", "target B", "target C" and "target D", the corresponding 
predetermined offsets for each of these targets may be defined as follows for a specific target 
design: 

Xa = +F+fO (for target A), 

Xb = -F+fO (for target B), 

Xc = +F-fD (for target C), and 

Xd = -F-fD (for target D). 

The offsets for Xa through Xd may be any suitable value for practicing the techniques of the 
present invention so as to determine overlay. For example, Xa and Xb may have different 
values of fO than Xc and Xd. 

Fig. 1 illustrates the distribution of offsets Xa, Xb, Xc and Xd along the x axis in a 
particular implementation of the invention. As shown, offsets Xa and Xc are both positive 
with Xa being larger than Xc. In contrast, offsets Xb and Xd are both negative with Xd 
being more negative than Xb. 

The number of targets and the magnitude and sense of their corresponding offsets 
may be chosen in any suitable manner so that the techniques of the present invention may be 
practiced to determine overlay error. A specific set of targets and their corresponding offsets 
are described below in relation to Figures 2(a) through 2(f). It should be readily apparent 
that there are numerous combinations of targets and offset values which may be utilized to 
practice the techniques and utilize the systems of the present invention. 
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Fig. 2(a) is a side view illustration of a patterned top layer L2 being offset by an 
amount F from a patterned bottom layer LI in accordance with one embodiment of the 
present invention. Each layer LI and L2 is patterned into a set of structures. A structure 
may include any suitable feature, such as a line, trench or a contact. A structure may be 
designed to be similar to a semiconductor device feature. A structure may also be formed 
from a combination of different features. Further, a structure may be located on any layer of 
the sample, e.g., either above the top layer of the sample, within any layer of the sample, or 
partially or completely within a layer of the sample. In the illustrated embodiment of Fig. 
2(a), layer LI includes the complete structures 204a-c, while layer L2 includes the complete 
structures 202a-c. Construction of scatterometry overlay targets structures and methods for 
producing them are described in U.S. patent application, having application no. 09/833,084, 
filed 10 April 2001, entitled "PERIODIC PATTERNS AND TECHNIQUE TO CONTROL 
MISALIGNMENT", by Abdulhalim, et al., which application is herein incorporated by 
reference in its entirety. 

As shown, the structures of the top layer L2 are offset by an amount F from the 
structures of the bottom layer LI. The structures of the two offset layers may be located 
within adjacent layers or have any suitable number and types of layers disposed in between 
the two offset layers. Fig. 2(a) also shows three films Tl, T2, and T3 between patterned 
layers LI and L2 and their corresponding structures. To the extent that any other layers exist 
between the two layers having the structures, these other layers exhibit at least a minimum 
degree of transmission for electromagnetic radiation to permit propagation of the radiation 
between the layers having the structures. 
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Fig. 2(b) is a side view illustration of a patterned top layer L2 being offset by an 
amount -F from a patterned bottom layer LI in accordance with one embodiment of the 
present invention. Fig. 2(c) is a side view illustration of a patterned top layer L2 being offset 
by an amount +F+fO from a patterned bottom layer LI in accordance with one embodiment 
of the present invention. Fig. 2(d) is a side view illustration of a patterned top layer L2 
being offset by an amount -F+fO from a patterned bottom layer LI in accordance with one 
embodiment of the present invention. Fig. 2(e) is a side view illustration of a patterned top 
layer L2 being offset by an amount +F+fO+E from a patterned bottom layer LI in accordance 
with one embodiment of the present invention. Fig. 2(f) is a side view illustration of a 
patterned top layer L2 being offset by an amount -F+fO+E from a patterned bottom layer LI 
in accordance with one embodiment of the present invention. 

In general, an error offset E may be determined by analyzing at least the measured 
spectra from four or more targets A, B, C, and D each having offsets between two patterned 
layers, such as offsets Xa through Xd. This analysis is performed without comparing any of 
the spectra to a known or reference spectra. In other words, the error determination 
techniques of the present invention do not require a calibration operation. 

Fig. 3(a) is a flow diagram illustrating a procedure 300 for determining overlay in 
accordance with one embodiment of the present invention. In this example, four targets A, 

B, C, and D are used which are designed to have offsets Xa through Xd as described above. 
That is, target A is designed with offset +F+fO; target B with offset -F+fO; target C with 
offset +F-fO; and target D with offset -F-fO. 

Initially, an incident radiation beam is directed towards each of the four targets A, B, 

C, and D to measure four spectra S A , Sb, So and S D from the four targets in operations 302a 
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through 302d. Operations 302a through 302d may be carried out sequentially or 
simultaneously depending on the measurement system's capabilities. The incident beam 
may be any suitable form of electromagnetic radiation, such as laser or broadband radiation. 
Examples of optical systems and methods for measuring scatterometry signals to determine 
overlay may be found in (1) U.S. patent application, having Patent No. 09/849,622, filed 04 
May 2001, entitled "METHOD AND SYSTEMS FOR LITHOGRAPHY PROCESS 
CONTROL", by Lakkapragada, Suresh, et al. and (2) U.S. patent application, having 
application no. 09/833,084, filed 10 April 2001, entitled "PERIODIC PATTERNS AND 
TECHNIQUE TO CONTROL MISALIGNMENT", by Abdulhalim, et al. These 
applications are herein incorporated by reference in their entirety. 

Further embodiments of suitable measurement systems and their use for determining 
overlay error are further described below. In various embodiments of the present invention, 
the spectra S A , Sb, Sc, and So (and any additional spectra that may be present) could include 
any type of spectroscopic ellipsometry or reflectometry signals, including: tan^F), cos(A), 
Rs, Rp, R, a (spectroscopic ellipsometry "alpha" signal), P (spectroscopic ellipsometry 
"beta" signal),((Rs-Rp)/(Rs+Rp)), etc. 

Spectrum S B (-F+f0) is then subtracted from spectrum S A (+F+f0), and spectrum S D (- 
F-fO) is subtracted from spectrum S c (+F-f0) to form two difference spectra Dl and D2 in 
operations 304a and 304b, respectively. Next, a difference spectrum property PI is obtained 
from the difference spectra Dl and a difference spectrum property P2 is obtained from the 
difference spectrum D2 in operations 306a and 306b, respectively. The difference spectra 
properties PI and P2 are generally obtained from any suitable characteristic of the obtained 
difference spectra Dl and D2. The difference spectra properties PI and P2 may also each 
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simply be a point on the each difference spectra Dl or D2 at a particular wavelength. By 
way of other examples, difference spectra properties PI and P2 may be the result of an 
integration of averaging of the difference signal, equal the an average of the SE alpha signal, 
equal a weighted average which accounts for instrument sensitivity, noise or signal 
5 sensitivity to overlay. 

After difference spectra properties PI and P2 are obtained, the overlay error E may 
then be calculated directly from the difference spectra properties PI and P2 in operation 308. 
In one embodiment, a linear approximation is performed based on the difference spectra 
properties PI and P2 to determine the overlay error E, while in another technique the 

10 difference spectra properties PI and P2 are used to approximate a sine wave function or 
other periodic function which is then used to determine the overlay error E. One linear 
regression technique is illustrated below with respect to Fig. 3(b). In one example, the 
overlay result may be obtained by a statistical calculation (e.g. averaging or weighted 
averaging) of overlay results obtained from properties of multiple wavelengths or multiple 

1 5 wavelength ranges . 

In a variation of this implementation, if all four targets have the same characteristics, 
such as pitch P, thin film characteristics, structure size and composition, except for the 
offsets, and assuming that Xa and Xb are opposite in sign and have the same order of 
magnitude (0.1<Xa/Xb<10), and if 0.05<|Xa/P|<0.45 and 0.05<|Xb/P|<0.45, and if Xa is the 
20 same sign as Xc and Xb is the same sign as Xd, an estimate of the overlay error E present 
between structures within the interlayer targets can be calculated as follows using a linear 
approximation based on the difference spectra properties PI and P2: 

E' =(( S c - S D )*(Xa + Xb)/2 + (S A - S B )*(Xc + Xd)/2)/(( S A - S B )-( S c - S D )) 
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or 

E' = (P2*(Xa + Xb)/2 + Pl(Xc + Xd)/2) / (PI - P2) 

where the difference spectra properties PI and P2 are generally opposite in sign for 
overlay errors E<fO. If (Xa-Xb) = (Xc-Xd) and E=0, then Pl= -1*P2. 

Alternatively, if the same values for F and ft) are used for designing each target offset 
Xa, Xb, Xc, and Xd, then 

E' = (fO*P2 + fO*Pl)/(Pl - P2). 

The targets may be used to determine overlay of structures located at least partially in 
more than one layer, but could also be used to determine overlay of structures located 
substantially in a single layer. 

Fig. 3(b) shows a graphical representation of the linear approach for determining the 
overlay error E in accordance with one embodiment of the present invention. As shown, the 
positive portion of the y axis shows a change in the difference spectra property PI as a 
function of fO+E and the negative portion of the y axis shows a change in the difference 
spectra as a function of -fO+E. As described above, the difference spectra properties PI and 
P2 are obtained from the difference spectra Dl and D2. 

The overlay error E may be obtained by analyzing the two points (+fO+E, PI) and (- 
fO+E, P2). The overlay error E may be determined in one approach by performing a linear 
approximation with the two obtained difference spectra properties PI and P2. Note that there 
are two points on the graph where E is zero, while other portions of the graph are a function 
of the overlay error E and fO. If the offsets are chosen carefully so as to be in the linear 
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region, then the slope of the positive portion of the graph (Pl/(+fO+E)) should equal the 
slope of the negative portion of the graph (P2/(-f0+E). Thus, the overlay error is given by 
E=fO*(Pl+P2)/(Pl-P2). 

According to an implementation of the invention, if there are two targets with offsets 
5 +F and -F that are equal in magnitude but opposite in sign and no other overlay errors, then 
the Oth diffraction order scatterometry SE or reflectometry spectra are substantially identical 
from these two targets (to a good approximation) and the difference signal between the 
spectra corresponding to +F and -F is zero. Of course, any property of the difference signal 
is also zero. If one deliberately breaks the symmetry (artificially induces an overlay error) 
10 by designing an additional offset +f0, then the difference signal Dl is no longer zero and any 
suitable difference spectra property follows the same relationship as for an overlay error E. 
Similarly one can design another set of overlay targets with an additional offset -fO. Thus, 
the overlay error may be determined using a property of the difference signals Dl (+F+fO, - 
F+fO) and D2 (+F-fO, -F-fD), and accordingly no separate calibration step is required. 

15 It should be understood that when the overlay error E is calculated from the spectra 

signals, it may be an estimate of actual overlay error. The calculated overlay error E may be 
denoted as the overlay error (E), or an estimate of the overlay error (E'). 

If the pitch between structures is relatively large then the above described linear 
approximation techniques generally work well. However, when the pitch is relatively small 
20 then additional targets may be produced on the sample to improve the accuracy of the 
overlay measurements. The number of targets and corresponding scatterometry techniques 
which are used depend on the particular materials of the target and the scatterometry signal 
type implemented, among other factors. Whether to use four or more targets can be 
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determined experimentally or by well known modeling methods. In one embodiment, two 
additional interlayer targets (denoted targets "H" and "J") are produced on the sample, with 
corresponding offsets Xh and Xj. Upon being illuminated by incident radiation, the targets 
H and J produce corresponding diffracted components, which can serve as a basis for 
determination of an additional difference signal D3 and difference spectra property P3. This 
property P3 may be analyzed in connection with the difference spectra properties PI and P2 
to refine the determination of the overlay E to include non-linear corrections or 
measurements of the errors introduced by using a linear approximation. 

One alternative embodiment to the linear approximation methods discussed above is 
to treat the scatterometry overlay signal as a periodic function and use phase detection 
methods to determine the overlay error. This embodiment may be preferred in some 
circumstances, depending on variables that may include scatterometry overlay target pitch, 
scatterometry overlay target design, scatterometry overlay (SCOL) target materials, the 
measured scatterometry signal, and the like. 

The overlay error may be extracted from measuring multiple SCOL targets with pre- 
programmed additional built-in overlay offsets. (One example of the preprogrammed offsets 
could be Xa, Xb, Xc, and Xd as discussed above and in Figure 1). The number of targets 
measured may be two, three, four, or more than four, or may vary between different overlay 
measurement locations. A scatterometry signal (as a function of the wavelength or incident 
angle, for example) is acquired from the required SCOL targets. For every arbitrary overlay, 
this signal is a periodic and even function of overlay error. A phase detection (or phase 
retrieval, phase extraction, or phase determination) algorithm utilizes these properties of the 
signals. The measured signal is represented by a set of even periodic functions with a 
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corresponding number of free parameters (one of these free parameters is the overlay error 
itself). Different sets of such functions may be used, depending, for example, on the number 
of targets measured, scatterometry signal properties, target properties, and information 
required. The number of targets measured is to be greater or equal to the cumulative number 
of free unknown parameters. When several (two or more) scatterometry overlay (SCOL) 
targets (with different pre-programmed offsets) are placed in the immediate vicinity of each 
other (within 250 microns, for example), the overlay error may be assumed to be the same 
for all these targets. Each of the other free parameters can either vary or not vary from one 
SCOL target location to the other one (within the field and/or across the wafer). (Overlay is 
assumed to vary between different overlay measurement locations). Alternatively, these free 
parameters (or some of them) may either vary or not vary between X- and Y- SCOL target 
orientations. Based on the information required, the measurement accuracy required and on 
whether some free parameters are not varying location-to-location and/or between X- and Y- 
orientations, the total number of SCOL targets per overlay measurement location and total 
number of SCOL targets to be measured per field and/or per wafer is determined. 

An example of a phase algorithm approach to determining overlay error from 
scatterometry signals from multiple targets is to treat the dependence of the scatterometry 
signals on overlay error as a periodic function. In this case the programmed offsets of the 
multiple targets are treated as initial phase offsets and the overlay error is treated as an 
additional phase. The overlay error can then be determined using well-known phase 
determination or phase retrieval methods. Well known phase retrieval methods that may 
include quadrature, 3 -bucket, and 4-bucket phase retrieval algorithms can be used to 
determine overlay error. These phase retrieval methods are listed as examples only and are 
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not meant to limit the scope of the invention. Phase detection methods are well known and 
are commonly used in diverse areas such as communications, interferometry, nuclear 
magnetic resonance, electronic circuits, to list a few examples. In another embodiment, a 
combination of linear, non-linear, and phase retrieval algorithms may be employed to 
determine the overlay error. 

Certain conditions are preferably met with implementation of the above described 
techniques. The measurement areas are substantially identical in all aspects except for the 
offsets, e.g., +F+fO, -F+fO, +F-fO, and -F-fO. This is likely accomplished by placing the 
targets within about 100 microns or less of each other and by choosing targets which are 
relatively-robust to the process (i.e. they have similar or less sensitivity to process variation 
as the device features). In practice, on production wafers, the profiles may deviate from 
identical for different offsets if the topography from the lower pattern layer(s) and the upper 
layer changes in response to interacting with this topography. A difference or error signal 
between the two targets with different offsets is relatively independent of profile variation of 
the overlay target segments and to film variation as long as the profiles are common to the 
different targets. This is the substantial equivalent of common mode rejection of the parts of 
the signal that are determined by the profile and the films and the optics. The technique is 
also preferably robust to the range of process variation encountered in a typical 
manufacturing process. The signal differences due to the overlay error are also preferably 
larger than the signal differences due to other sources of process variation between the 
nearby scatterometry overlay targets (including mask errors). 

If in a particular implementation the targets include structures grouped to exhibit the 
characteristics of lines, then a separate set of targets may be required for X and Y overlay 
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measurements. If the overlay targets are composed of 2-dimensional structures (as seen 
from a top down view), then it may be possible to use one set of targets to get both X and Y 
overlay information. For oblique scatterometry, according to a specific implementation, it 
may be advantageous to rotate the orientation of the wafer with respect to the optical 
scattering plane to measure the different X and Y overlay errors. For true normal incidence, 
it may be possible to get both X and Y overlay information from the different polarizations 
without rotating the wafer or the optics. 

Cartesian coordinates provide a convenient frame of reference for measuring overlay 
within a sample, with the x-y plane being located within, or substantially parallel with, a 
layer of the sample, and with the z axis being substantially perpendicular to the layers of the 
sample. The Cartesian coordinate system could be fixed with respect to the sample or could 
be rotated to reduce the complexity of the measurements. For example, overlay occurring 
diagonally across the sample but within a single layer could be described as two-dimensional 
x-y overlay in a Cartesian system with the x-y axes substantially parallel with the sides of a 
rectangular sample. That same diagonal overlay could be measured along a single axis, 
however, by rotating the x-y axes such that the x axis is parallel with the direction of the 
diagonal overlay. Three dimensional overlay could be reduced to two-dimensional overlay 
by restricting measurements within an x-y plane substantially parallel with a layer and 
ignoring any interlayer overlay occurring in the z direction. 

In one embodiment, targets include more than one predefined offset, possibly 
between different sets of structures located in two layers, or possibly between different sets 
of structures located in more than two layers. In a general case, a target may include an 
indefinite number of layers, with all or some of these layers having structures producing 
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predefined offsets. In a particular implementation, the structures in one or more underlying 
patterned layers of a target cause changes in the shape or topography of one or more upper 
layers (disposed above the underlying patterned layer(s)). In this implementation, the one or 
more upper layers may be substantially or partially opaque or absorbing, and at least part of 
the diffraction signal may arise from the topography of an upper layer, the topography 
arising at least in part from the underlying patterned layer. 

According to one embodiment, structures included in a target may be organized in 
various configurations and shapes, including, for example, lines, grids, rectangles, squares, 
curved lines, curved shapes, or combinations of the foregoing. Such configurations of 
structures may be disposed at various locations within the target, and may describe various 
angles with respect to the electromagnetic radiation incident on the target. For example, the 
sets of structures could be organized as a set of parallel lines perpendicular to the direction of 
propagation of a collimated set of radiation rays or of a beam incident on the target. In 
another case, the structures organized as a set of parallel lines could be disposed at an acute 
angle with respect to the incident radiation, possibly at an angle of 45 degrees. Such a 
configuration could be advantageous by facilitating determination of overlay in both x and y 
directions, thereby reducing the need for additional overlay patterns or measurements. 

1 . Scatterometrv System Embodiments and Use of Same 

Several of the techniques of the present invention may be implemented using any 
suitable combination of software and/or hardware system. For example, the techniques may 
be implemented within an overlay metrology tool. Preferably, such metrology tool is 
integrated with a computer system which implements many of the operations of this 
invention. Such composite system preferably includes at least a scatterometry module for 
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obtaining scatterometry signals of the overlay targets, and a processor configured to analyze 
the obtained scatterometry signals to thereby determine overlay error within such targets. At 
a minimum, the scatterometry module will usually include (i) a source of illumination 
oriented to direct radiation onto a specified location of the sample; and (ii) one or more 
detectors oriented to detect a scatterometry signal which has been scattered by the sample. 

At least a portion of the techniques of the present invention may also be implemented 
in an overlay metrology system as an additional overlay measurement capability which 
complements an overlay measurement system or sub-system based on image analysis such 
as one used for conventional box-in-box or frame-in-frame overlay targets or other imaging 
type overlay measurement structures. Examples of apparatus which combine imaging-based 
overlay metrology and scatterometry-based overlay are described in the above referenced 
Provisional Application No. 60/498,524, which is incorporated here by reference. Overlay 
data from imaging overlay measurements and scatterometry overlay measurements may be 
combined for various uses including: calculating the overlay correctables, calculating other 
overlay corrections, calculating overlay errors at other locations on the wafer. More use 
cases for combinations of imaging overlay metrology and scatterometry overlay metrology 
are also described in above referenced Provisional Application No. 60/498,524. 

Regardless of the system's configuration, it may employ one or more memories or 
memory modules configured to store data, program instructions for the general-purpose 
inspection operations and/or the inventive techniques described herein. The program 
instructions may control the operation of an operating system and/or one or more 
applications. The memory or memories may also be configured to store scatterometry data 
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obtained from the targets and overlay error results and optionally other overlay measurement 
data. 

Because such information and program instructions may be employed to implement 
the systems/methods described herein, embodiments of the present invention relates to 
5 machine readable media that include program instructions, state information, etc. for 
performing various operations described herein. Examples of machine-readable media 
include, but are not limited to, magnetic media such as hard disks, floppy disks, and 
magnetic tape; optical media such as CD-ROM disks; magneto-optical media such as 
floptical disks; and hardware devices that are specially configured to store and perform 
10 program instructions, such as read-only memory devices (ROM) and random access memory 
(RAM). The invention may also be embodied in a carrier wave traveling over an appropriate 
medium such as airwaves, optical lines, electric lines, etc. Examples of program instructions 
include both machine code, such as produced by a compiler, and files containing higher level 
code that may be executed by the computer using an interpreter. 

15 Several of the system embodiments described below are mainly described and 

illustrated with respect to a scatterometry module or components for obtaining spectra from 
four or more targets, while the processor and memory are not shown. 

Imaging Metrology Systems in Which the Numerical Aperture is Optimized for 
Measurement of Scattering Structures 

20 Fig. 4 is a diagrammatic representation of a microscopic imaging system. As shown, 

the imaging system 400 includes a beam generator 402 for producing an incident beam 403 
of electromagnetic radiation, a beam splitter 404 for directing the incident beam 405 towards 
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the sample 408. Typically, the incident beam is focused onto the sample by an objective 
lens 406. An output beam 409 is then emitted from the sample in response to the incident 
beam and passed through the beam splitter 404 through relay lens 410 onto imager or camera 
412. The camera 412 generates an image of the sample based on the output beam 409. 

5 The system 400 also includes a processor and one or more memory 414 which are 

configured to control the various components, such as the beam generator 402, objective lens 
406, and cameral 412. The processor and memory are also configured to analyze the 
detected output beam or image implementing the various scatterometry techniques described 
above. 

10 Traditionally, such imaging systems (such as those used for overlay) have selected 

numerical apertures (NA's) (e.g., via objective lens 406) to optimize image resolution and to 
minimize optical aberrations. Selection of NA is typically performed in order to derive the 
overlay information from the variation in intensity over a single target (such as a box-in-box 
target ) from the geometrical properties of the image. 

15 Conventional imaging systems have relied upon high numerical apertures (NA's), 

such as 0.7 to 0.9, but doing so results in an expensive optical system which is sensitive to 
vibration, depth of focus, and optical aberrations. These issues reduce the achievable 
precision and cause a measurement error referred to as "tool induced shift" or TIS. 

Scatterometry systems may take measurements at multiple sites sequentially in order 
20 to measure both x and y overlay and to eliminate effects due to variations in other sample 
parameters, such as film thickness. This type of measurement process results in significantly 
slower operation of the scatterometry tool relative to conventional overlay techniques. 
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In one embodiment of the present invention, the illumination and imaging NA's of an 
imaging optical system are chosen to optimize the performance of the instrument on 
scattering structures by ensuring that only the zero'th diffraction order is collected. One may 
take advantage of the fact that there exist performance advantages for certain metrology or 
inspection tasks pertaining to periodic structures when only the zero order diffraction is 
collected by the detection system. Under this condition, only the specular reflection is 
collected. Since the output which is scattered out of the specular is not collected and the 
nonspecular output may be more sensitive to aberrations, collection of only the specular 
output tends to minimize effects caused by optics aberrations. This condition also will result 
in a tool which will be optimized for relative photometric measurements of multiple sites in 
the field of view as described further below. Very low TIS may also be achieved, as 
compared to conventional imaging systems. Much higher throughput may also be achieved 
than with conventional scatterometry systems. 

Choosing the illumination and imaging NA's for a particular imaging system is based 
on the particular configuration of such system. If we now consider the simplest imaging 
system in which the numerical aperture NA of illumination and collection are the same and 
the incident beam in normal to the sample surface, then the condition of "zero order 
diffraction only" can be met if: 

rik > 2dNA , where n =1 . 

where d is the pitch of the structures of the targets being imaged. This can be restated in 
terms of the numerical apertures of illumination NAt and collection NA C of the imaging 
system as: 
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nX = d(NA t + NA C ) 

This equation indicates that if we are able to constrain the numerical aperture of the 
illumination system, we can relax the constraint on the numerical aperture of the collection 
optics, which may be advantageous under certain conditions. Thus, the spectral range may 
be restricted to wavelengths greater than twice the product of the pitch and the NA. Under 
realistic conditions the scattered radiation beam will be wider (more divergent) than the 
illumination beam. Under realistic circumstances however, infinitely periodic gratings are 
not imaged and so the above equations become approximations and the diffracted plane 
waves become somewhat divergent. So it may be preferable to include a margin of safety in 
the constraint and require that: 

nX > 2dNA(l + s) , where n =1 and s is small, typically less than 0.5. 

As an example, for an NA 0.4 imaging system, wavelengths may be restricted to 
values greater than 0.8 times the largest pitch, which does not seem to be an unreasonable 
constraint. For periodic structures having features of design rule 70 nm and below, the 
densest structures with pitches as low as 200 nm does not constrain the spectral range of 
imaging systems with operating wavelengths equal to about 200 nm or longer, while more 
isolated features with pitches as large as 500 nm are preferably measured with wavelengths 
longer than 400 nm. 

It is preferable to account for these constraints when designing an imaging 
spectrometer for metrology and inspection applications. A limit on the spatial resolution of 
the imaging system is the numerical aperture of the system. It is advantageous to achieve the 
highest spatial resolution so as to be able to shrink to a minimum the size of metrology 
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structures and conserve valuable wafer real estate. Restated, this allows minimization of 
proximity effects or "crosstalk" between adjacent features in the field of view of the imaging 
spectrometer. Therefore, the highest possible NA is achieved while meeting the constraint 
that the zero order diffraction is collected by the detection system. 

Another interesting outcome of this constraint is that the highest possible overlay 
spatial resolution may be achieved without ever resolving the features under test. This may 
have further advantages as it should ensure that problematic aliasing phenomena are avoided 
in the imaging system. In a preferred embodiment, an architecture is provided in which the 
spectral band pass can be easily modified or selected by the measurement system or 
algorithm based on the largest pitch in the feature under test (e.g., as the systems in Figures 
5A through 5D described further below). Alternatively, the NA of either illumination or 
collection could be easily modified, depending on the largest pitch in the feature under test. 
Alternatively, all of these embodiments may be implemented within a single system. 

Figs. 5A through 5E illustrate four embodiments of microscopic imaging systems 
having a numerical aperture (NA) optimized for scattering characteristics. As shown in 
Figure 5A, the system 500 may have components which operate like the same named 
components of the system in Figure 4. The system 500 further includes a wavelength 
selection device 520 for selecting a particular wavelength. The wavelength selection device 
520 allows modification of the spectral band by selectively transmitting or selectively 
reflecting part or portions of the illuminating radiation. A wide variety of well known 
spectroscopic filtering techniques may be employed to modify the spectra band, including 
selecting from a set of band pass interference filters, continuously varying bandpass 
interference filters, grating based spectrometers, Fourier transform interferometers, acousto- 
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optic tunable filters, to name a few. The wavelength selection device 520 is positioned 
within the incident beam path between the beam. The system 500 may also include a 
polarizer control device 522 for causing the incident beam to be in a particular polarization 
state and a polarization analyzer 524 for analyzing or separating out the polar components of 
the collected beam. 

The system 530 of Figure 5B is similar to the system 500 of Figure 5A, except a 
wavelength modulation device 532 is used in place of a wavelength selection device. The 
system 540 of Figure 5C is similar to the system 500 of Figure 5 A, except the wavelength 
selection device 542 is positioned in the output beam path. The system 550 of Figure 5D is 
similar to the system 500 of Figure 5C, except a wavelength modulation device 532 is used 
in place of a wavelength selection device. The wavelength modulation device 532 can be 
controlled by changing one or more optical path lengths in the wavelength modulation 
device 532 itself (e.g., an interferometric system, such as in a Michelson, Fabry-Perot, or 
Sagnac interferometers). The spectral information may be derived from the resulting signal 
with a transform analysis like a Fourier transform or Hadamard transform, for example. 

Figure 5E is a top view representation of an imaging spectrometer, multiple site 
field-of-view example in accordance with one embodiment of the present invention. In one 
implementation, spectra from pixels in each dotted box are averaged to create a spectrum for 
each of the four measurement targets. Alternatively, spectra from pixels located only in a 
central region of each dotted box are averaged together. Size and spacing of lines in the 
illustrated targets are exaggerated for emphasis. 

The NA may be selected to ensure that only the zero'th diffraction order is collected 
in any suitable manner. In one proposed operational embodiment: 
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1 . Two or more sites of differing characteristics are located in the field of view 
of the imaging system. 

2. Images are captured over one or more spectral ranges. 

3. For each measurement site in the field of view, all or some of the pixels 
determined to be within that site are summed or otherwise combined to 
characterize the photometric properties of that site in that spectral range. 

4. Step 3 is repeated for each spectral range. 

5. The results for each site over each spectral range are processed to determine 
the properties of the sample. For example, the above described spectral 
analysis techniques (i.e., F+fO) are used on the obtained spectra for each 
target. 

6. Steps 1 though 5 are repeated for the plurality of measurement sites desired 
across the wafer. 

While this example technique describes sequentially capturing images over different 
spectral regions, this could be accomplished simultaneously using a system of wavelength 
dependent beam splitters, filters, and/or mirrors. Alternatively, the same could be affected 
by using a device such as a Sagnac interferometer which captures multiple images at 
different optical path differences, these being used to derive information equivalent to 
images taken over different spectral ranges. 

Scatterometric Overlay Using Filters 
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Conventional imaging overlay tools have a high magnification and small field of 
view. Inspection for gross patterning defects is either done manually on a microscope or 
automatically on a separate macro inspection tool. A low magnification overlay tool 
unfortunately requires multiple steps or tools, some of which are manual. 

In one embodiment, a low magnification microscope with a mechanism for selecting 
one or more wavelength ranges is provided. This tool also preferably uses one or more 
broadband sources with filters, with multiple sources covering different wavelength ranges, 
variable filters, etc. Figure 6 is a diagrammatic representation of a system 600 for selecting 
one or more wavelength ranges in accordance with one embodiment of the present invention. 
As shown, the system 600 includes a broadband source 602 for generating a multiple 
wavelength incident light beam 604 towards sample 606. A multiple wavelength output 
beam 608 is scattered from the sample 606 in response to the incident beam 604. The 
system 600 also includes a filter 610 for selectively passing a portion of the output beam 611 
based on wavelength to camera 612. In one implementation, the filter is configurable to pass 
particular colors, such as red, green, blue, or yellow. The camera is operable to generate an 
image based on the filtered output beam 611. 

Measurements of overlay are taken by moving to a location on the sample where one 
or more of the targets in a target set are in the field of view of the microscope. An image is 
acquired and the intensity from some or all of the pixels in the image, which includes each 
individual target, are averaged or summed to give an intensity value for the target at a 
particular setting of the filter. In one embodiment, the filter is adjusted so as to give a 
maximum difference between targets. This may subsequently be normalized with respect to 
a reference surface, to the number of pixels summed, or corrected by a map of the 
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illumination uniformity within the field of view. The sample or optics may then be moved 
until all of the necessary targets in a target set are measured. The overlay value is then 
determined using the intensity values thus determined, as described above, e.g., by: 

PI =( la -lb) andP2 = (Ic-Id) 

5 And : 

Overlay = £0 * ( P2 + PI ) / ( P2 - PI ) 

This process could be repeated over multiple wavelength ranges to improve accuracy, 
precision, and robustness, wherein the wavelengths which result in the best contrast are used 
for the scatterometry analysis. 

10 Because the magnification is low and the field of view is large compared to a typical 

imaging overlay tool and because an image of the area of the sample is colleted, unlike a 
conventional reflectometer or ellipsometer, analysis of the image may be used to detect other 
types of processing problems by analyzing the image. For example, if the wrong reticle has 
been used for one or more processing steps, the image would be materially different. If the 

15 resist thickness were incorrect, the brightness or contrast of the image may be effected. If 
resist streaking were present, variation of brightness or contrast over the image may be 
detected. In CMP (chemical mechanical polishing) processes, processing errors such as 
over-polish, under-polish, etc. could be similarly detected. 

In this embodiment, multiple scatterometry targets can be measured simultaneously, 
20 increasing the measurement speed. Additionally, processing errors or changes in processing 
conditions other than overlay can be detected without the need for a separate inspection tool. 
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Multi-Angle, Simultaneous Scatterometry 

Techniques of obtaining scatterometric measurements may include the 2-theta 
approach, in which scattering intensity from a grating or other repeating structure is 
measured at a plurality of angles by making multiple, sequential measurements. Another 
technique of making scatterometric measurements is spectroscopic scatterometry. Use of the 
2-theta approach is very slow, since multiple measurements are typically made. Use of 
spectroscopic scatterometry requires sophisticated expensive optics. 

In a specific embodiment of the present invention, techniques and apparatus for 
simultaneous, multi-angle scatterometry are provided. Unlike the 2-theta approach, 
measurements are made with an apparatus which permits scattering intensity to be 
simultaneously determined for many angles. This technique is far faster than the 2-theta 
approach. 

In order to implement this approach, an optical apparatus such as that shown in US 
Patent 5,166,752 by Spanier et al could be used. This patent is herein incorporated by 
reference in its entirety. In this patent, a multi-angle ellipsometer is shown in, for example, 
Figures 3 and 4 of the Spanier et al. patent. Figure 7 is a diagrammatic representation of a 
simultaneous, multiple angle of incidence ellipsometer 700. As shown, the ellipsometer 
includes a source generator (e.g., components 702, 706, 708, 710, and 712) for directing 
polarized light onto the surface of sample 714, detection optics (e.g., components 718 
through 724) for handling and detecting the output beam emitted from the sample, and an 
analyzer 726 for analyzing the polarization state of the light reflected from the sample. The 
source generator includes a light source 702, a polarizer 708, a compensator 710 with a 
variable aperture, and a focusing lens system 712 simultaneously directing polarized light 
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from a single beam of light from the light source onto the sample's surface at different 
angles of incidence. The source generator may also include an optional optical narrow band 
filter. 

The lens system 712 has an effective aperture to focal length ratio for focusing the 
5 light on the sample 714 with angles of incidence which vary over a range of angles of at 
least one or two degrees. In a particular embodiment, the range of angles of incidence is 30 
degrees. Larger angles could be employed for directing rays at the sample 714. 

The focusing lens system 712 focuses the polarized light which may be from a He- 
Ne laser for example, down to a single small spot or point on the sample 714. The different 

10 incident rays may have widely varying angles of incidence which are focused on a single, 
small spot on the sample 714. Thus, the light directed on the small spot on sample 714 
contains rays at many angles of incidence above and below the angle of incidence of the 
central ray through the focusing lens. Each one of the incoming rays is reflected at an angle 
equal to its angle of incidence with the polarization state of each of the rays being altered by 

15 that reflection. A detector array 722 is employed to detect a plurality of rays reflected from 
the sample 714 individually over different, narrow ranges of angles of incidence to simply 
and quickly obtain data at a plurality of angles of incidence. 

The output beam emitted from the sample 714 is directed through output lens 716, 
interchangeable aperture 718, polarization analyzer 720, and an optional alternate filter 722 
20 onto detector array 722. The diameter d of the lenses 712 and 716 corresponds to their 
effective diameter. In the illustrated embodiment the lenses 712 and 716 each have a 
diameter d of 18 mm and a focal length 1 of 34 mm. Other effective lens diameters and 
focal lengths could be employed so long as a range of angles of incidence, preferably at least 
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30 degrees, is provided. The lens diameter and focal length are chosen with a view toward 
maximizing the number of angles of incidence of the light beams which strike the sample 
714. In an alternative embodiment, light is transmitted through the sample rather than 
reflected from a surface of the sample. 

The refocusing lens or lenses 712 directs the reflected (transmitted) light toward the 
detector array 722. However, a refocusing lens need not be employed as the reflected 
(transmitted) light could be made to directly impinge upon an array of detectors. It is 
preferable that the lenses 712 and 716 do not themselves alter the polarization state of the 
light. 

The detector array 722 may be a linear, multiple element detector wherein each of the 
detector elements can detect a narrow range of angles of incidence of the rays that illuminate 
the sample. In the disclosed embodiment, the array 722 is a solid-state photosensitive 
detector array wherein the separate detector elements are all integrated on one circuit chip. 
Particularly, the detector elements comprise a linear array of photodiodes. While integrated 
on a single circuit chip, the individual photodiodes can function as separate detectors. The 
linear array of the disclosed embodiment comprises 128 detector elements arranged in a row 
to provide data for 128 different angles of incidence where the full array is illuminated by 
the reflected (transmitted) light. The number of individual detector elements could be more 
or less than that in the disclosed embodiment and the detector elements need not be 
integrated on a single chip but could be discrete detectors. By using a plurality of detector 
elements, it is possible to simultaneously detect the light reflected from the surface (or 
transmitted through the sample) for each of a plurality of different angles of incidence. It is 
also possible with the invention to employ a smaller number of detector elements which 
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could be sequentially moved to mechanically scan the reflected (transmitted) rays for 
detection but this technique would require more time and could be less accurate, depending 
upon positioning accuracy. 

The physical size of each of the detector elements is preferably less than the expanse 
of the reflected rays so that each element detects only a certain narrow range of angles of 
incidence on the illuminating side. The output of each of the detectors is used in a 
conventional manner as with real time computer techniques {e.g., via analyzer 724) to 
generate data in terms of A and *F for each of those narrow ranges of angles of incidence. 
The data is then interpreted in a conventional manner. It matters in general which direction 
the linear array runs; the linear array preferably runs in the plane of the optical system. In 
the disclosed embodiment, the long axis of the linear detector array 722 lies in the plane of 
incidence of the central ray and perpendicular to the central ray for detecting the maximum 
number of incidence angles. 

Such an ellipsometer could be used to illuminate a scatterometry target 
simultaneously over a range of angles, and an intensity of the scattered light is measured 
over a range of angles simultaneously with an array detector or the like. By collecting data 
from the intensities measured at those angles, the parameters of the grating or other target 
can be determined. For example, the data can be compared against theoretical models of 
data derived from techniques such as those mentioned by U.S. Patent No. 6,590,656, issued 
July 8, 2003, entitled "SPECTROSCOPIC SCATTEROMETER SYSTEM" by Xu et al, 
which patent is herein incorporated by reference in its entirety. The data can be compared to 
theoretical models derived from techniques such as those mentioned by U.S. patent 
application, having application no. 09/833,084, filed 10 April 2001, entitled "PERIODIC 
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PATTERNS AND TECHNIQUE TO CONTROL MISALIGNMENT", by Abdulhalim, et 
al., which application is herein incorporated by reference in its entirety. 

The data can be pre-generated and stored in libraries, or generated in real time during 
analysis. It is also possible, for techniques like scatterometric overlay, to directly compare 
5 measured spectra associated with various targets. Such differential measurements can then 
be used to determine overlay misregistration. 

It would also be possible to perform this technique with a beam profile reflectometer 
such as that described in US Patent No. 4,999,014, issued 12 March 1991, entitled 
"METHOD AND APPARATUS FOR MEASURING THICKNESS OF THIN FILMS" by 
10 Gold et al., which patent is incorporated herein by reference in its entirety. 

Simultaneous Ellipsometrv and Reflectometry 

A system for employing a combination of ellipsometers and reflectometers may be 
employed to improve the accuracy of scatterometric measurements of overlay. In one 

15 embodiment, two or more ellipsometers are utilized as scatterometers to measure overlay. 
One of more of these ellipsometers could be spectroscopic ellipsometers. In another 
embodiment, two or more reflectometers are utilized as scatterometers to measure overlay. 
One of more of these reflectometers could be polarized reflectometers. Alternatively, a 
combination of one or more ellipsometers and one or more reflectometers are utilized as 

20 scatterometers to measure overlay. 

Measurements can be performed serially (with each tool performing measurements at 
different times), in parallel (with all tools performing measurements substantially- 
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simultaneously, or in any other arrangement (e.g., at least two but less than all of the tools 
performing measurements substantially-simultaneously). 

In any of the implementations described herein, various tools may perform 
measurements at different angles of incidence, including normally and obliquely. In a 
5 specific implementation, at least two tools perform scatterometric measurements at 
substantially the same angle of incidence but from different directions. For instance, a first 
tool would be used for scatterometric measurements in the x direction, and a second tool 
would be used for scatterometric measurements in the y direction. Such a system could 
eliminate certain common scattered signals, with a corresponding increase in accuracy of 
10 measurements, and provide a symmetric configuration. 

An advantage of employing a combination of such tools in scatterometric 
determination of overlay is that the accuracy of the measurements could be increased. 
Another advantage in using more than one tool and performing measurements at more than 
one angle (or point) of incidence is to help separate effects affecting the medium of interest 

15 (e.g., film effects) from overlay. A further advantage is that combinations of ellipsometers 
and reflectometers already exist in current inspection tools. Another advantage of employing 
a combination of scatterometers configured to perform scatterometry measurements 
substantially in parallel on different targets or different target sections could be to reduce the 
total time required for measurement. Another advantage of a parallel measurement system 

20 could be to increase the signal acquisition time for each scatterometry overlay target and 
improve the measurement precision. 

Scatterometric Overlay Determination Using FT processing 
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A system for scatterometric measurement of overlay using fourier transform (FT) 
processing may also be utilized. In one embodiment, an interferometer is employed to 
modulate substantially all wavelengths of a broadband source, and the scattered radiation is 
detected with a CCD camera. Substantially all wavelengths of the modulation band are 
recorded for each pixel, or for groups of pixels. As the interferometer steps through the 
modulation band, a spectroscopic image of the scattered signal is produced. 

The resulting spectroscopic image may have a relatively large field of view. For 
example, the image may include several multiple targets. The spectroscopic image could be 
processed on a pixel-by-pixel basis to accurately determine overlay while eliminating 
extraneous effects (e.g., film effects). Alternatively, processing could be performed using 
groups of pixels to improve speed and decrease processing resources. For example, a group 
of pixels from each target may be analyzed in accordance with the above described 
scatterometry process. In one implementation, the images for each corresponding pair of 
targets is subtracted to obtain difference images Dl and D2. A characteristic, such as 
average intensity, of each difference signal is then obtained to result in PI and P2, which are 
then used to determine the overlay error. 

In a specific implementation, a Michelson interferometer is used to step through a 
wavelength modulation band. Alternatively, a Linnik interferometer, or any other 
interferometer, could be employed. For each position of the mirror, a CCD camera records 
the scattered signal intercepted in the field of view of the camera. The detected signals may 
then be digitized and stored on a pixel-by-pixel basis, or as groups of pixels. The magnitude 
of the steps is generally proportional with the accuracy of the overlay measurement. The 
speed of the camera (e.g., the number of fields per second that the camera can capture) is 
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typically proportional with the speed of the measurement. Once the modulation band is 
spanned, the signal recorded for each pixel (or group of pixels) may be used as a basis for a 
discrete fourier transformation (or DFT). The DFT provides a spectral profile for each pixel 
(or group of pixels). This spectral profile for each target may then be used in the 
scatterometry overlay techniques described in the previous paragraph. Overlay 
determination can then be performed with increased accuracy. 

Multiple Tunable Lasers 

A system which has a combination of tunable lasers may be utilized to improve the 
accuracy of scatterometric measurements of overlay in combination with measurements 
performed by various configurations of ellipsometers and reflectometers. The tunable lasers 
provide radiation incident on the surface of interest. In one embodiment, scatterometric 
overlay measurements are performed using targets disposed in at least one layer of the 
design under consideration, and the tunable lasers provide radiation beams incident on the 
targets. 

The measured signals may then be averaged together before or after processing. In 
one embodiment, measured radiation beams are obtained from targets A, B, C, and D. Two 
difference signals Dl an D2 from each pair of targets may then be obtained at multiple 
tunable laser settings. The signals measured from each target for each tunable laser setting 
may be averaged together prior to obtaining the difference signals Dl and D2. Alternatively, 
each set of differences signals for Dl and D2 may be averaged together to obtain a single 
average difference signal Dl and D2. Properties PI and P2 of the difference signals Dl and 
D2 {e.g., integration) may then be obtained. In an alternative embodiment, multiple 
properties PI and P2 are obtained for the different configurations of the tunable laser 
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(without averaging the measured signals or the difference signals Dl and D2) and the results 
are averaged for each signal PI and P2. The overlay error may then be obtained based on 
the signals PI and P2 as described above. 

Scatterometric Overlay Determination Using Spatial Filtering 

One embodiment expands on the above described embodiment for Scatterometric 
Overlay Determination using FT Processing. 

A system for scatterometric measurement of overlay using FT processing in 
connection with spatial filtering is provided. More particularly, the signal reflected by at 
least one scatterometry target is selectively filtered spatially to only process particular signal 
components. 

In the above described embodiment for Scatterometric Overlay Determination using 
FT Processing, an interferometer is employed to modulate substantially all wavelengths of a 
broadband source, and the scattered radiation is detected with a detector, such as a CCD 
camera. Substantially all wavelengths may then be recorded for each pixel, or for groups of 
pixels. As the interferometer steps through the modulation band, a spectroscopic image of 
the scattered signal is produced. In the present example, where the scattered signal 
corresponding to a complete image (or a portion of an image) is collected, only a portion of 
the signal corresponding to a single line of pixels is retained. Alternatively, a portion of the 
signal corresponding to a plurality of pixel lines, but less than the whole image, is collected. 
Such a selective collection of the scattered signal can be achieved by spatially filtering the 
signal to only retain horizontal, vertical or oblique stripes of the signal corresponding to 
rows of pixels in the detector or CCD camera. Alternatively, a larger, more complete portion 
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of the scattered signal could be collected at the CCD camera, but the information 
corresponding to undesirable rows of pixels (e.g., an edge of a target or a border between 
two targets) may be discarded subsequent to the collection. 

The spectroscopic image corresponding to the retained signal may then be processed 
on a pixel-by-pixel basis to accurately determine overlay while eliminating extraneous 
effects (e.g., film effects). Alternatively, processing could be performed using groups of 
pixels to improve speed and decrease processing resources. This embodiment provides 
higher SNR over conventional processing methods. 

In one implementation of the invention, the above described techniques to determine 
overlay in reference to the Scatterometric Overlay Determination using FT Processing 
embodiment may be used. 

Compared to the embodiment for Scatterometric Overlay Determination using FT 
Processing, aspects of the embodiment for Scatterometric Overlay Determination Using 
Spatial Filtering may improve the processing speed and the throughput, while decreasing 
processing resources. 

Examples of Spectroscopic Ellipsometers and Spectroscopic Reflectometers 

Figure 8 is a schematic view of a spectroscopic scatterometer system 800, in 
accordance with one embodiment of the present invention. The system 800 combines the 
features of a spectroscopic ellipsometer 802 and spectroscopic reflectometer 804, each of 
which may be used for measuring overlay of a grating structure 806 disposed on a substrate 
or wafer 808. The grating structure 806, which is shown in a somewhat simplified format in 
the Figure, may be widely varied. The grating structure 806 may, for example, correspond 
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to any of those grating structures described herein. Both the spectroscopic ellipsometer 802 
and spectroscopic reflectometer 804 may utilize a stage 810, which is used for moving the 
substrate 808 in the horizontal xy directions as well as the vertical z direction. The stage 
may also rotate or tilt the substrate. In operation, the stage 810 moves the substrate 808 so 
5 that the grating structure 806 can be measured by the spectroscopic ellipsometer 802 and/or 
the spectroscopic reflectometer 804. 

The spectroscopic ellipsometer 802 and spectroscopic reflectometer 804 also utilize 
one or more broadband radiation sources 812. By way of example, the light source 812 may 
supply electromagnetic radiation having wavelengths in the range of at least 230 to 800 nm. 
10 Examples of broadband light sources include deuterium discharge lamps, xenon arc lamps, 
tungsten filament lamps, quartz halogen lamps. Alternatively, one or more laser radiation 
sources may be used instead of or in combination with the broadband light source. 

In the spectroscopic reflectometer 804, a lens 814 collects and directs radiation from 
source 812 to a beam splitter 816, which reflects part of the incoming beam towards the 
15 focus lens 818, which focuses the radiation onto the substrate 808 in the vicinity of the 
grating structure 806. The light reflected by the substrate 808 is collected by the lens 818, 
passes through the beam splitter 816 to a spectrometer 820. 

The spectral components are detected and signals representing such components are 
supplied to the computer 822, which computes the overlay in a manner described above. 

20 In the spectroscopic ellipsometer 802, the light source 812 supplies light through a 

fiber optic cable 824, which randomizes the polarization and creates a uniform light source 
for illuminating the substrate 808. Upon emerging from the fiber 824, the radiation passes 
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through an optical illuminator 826 that may include a slit aperture and a focus lens (not 
shown). The light emerging from the illuminator 826 is polarized by a polarizer 828 to 
produce a polarized sampling beam 830 illuminating the substrate 808. The radiation 
emerging from the sampling beam 830 reflects off of the substrate 808 and passes through an 
analyzer 832 to a spectrometer 834. The spectral components of the reflected radiation are 
detected and signals representing such components are supplied to the computer 822, which 
computes the overlay in a manner described above. 

In the spectroscopic ellipsometer 802, either the polarizer 828 or the analyzer 832 or 
both may include a waveplate, also known as compensator or retarder (not shown). The 
waveplate changes the relative phase between two polarizations so as to change linearly 
polarized light to elliptically polarized light or vice versa. 

In order to collect more information about the interaction of the incident polarized 
light 830 with the sample, it is desirable to modulate the polarization state of the light or 
modulate the polarization sensitivity of the analyzer or both. Typically this is done by 
rotating an optical element within the polarizer and/or analyzer. A polarizing element within 
the polarizer or analyzer may be rotated, or, if at least one of those assemblies contains a 
waveplate, the waveplate may be rotated. The rotation may be controlled by the computer 
822 in a manner known to those skilled in the art. Although the use of a rotating element 
may work well, it may limit the system 802. As should be appreciated, the use of rotating 
elements may be slow, and because there are moving parts they tend to be less reliable. 

In accordance with one embodiment, therefore, the polarizer 828 is configured to 
include a polarization modulator 836, such as photoelastic modulator (PEM), in order to 
produce a fast and reliable spectroscopic ellipsometer. The polarization modulator replaces 
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the rotating waveplate. The polarization modulator 836 is an optical element that performs 
the same function as a rotating waveplate, but without the costly speed and reliability 
problems. The polarization modulator 836 allows electrical modulation of the phase of the 
light without mechanically rotating any optical components. Modulation frequencies as high 
5 as 100kHz are readily attainable. 

In an alternative embodiment, the analyzer 832 is configured to include a polarization 
modulator such as a PEM (Photoelastic Modulator) that can be modulated electrically. In 
yet another embodiment, both the polarizer and analyzer contain polarization modulators, 
such as PEMs, that are modulated at different frequencies. 

10 Because the polarization modulator 836 can modulate at such a high frequency, the 

polarization modulator 836 may be used to perform various techniques, which would 
otherwise be too slow. For example, the difference between the polarized reflectivity of two 
structures may be obtained. To do this, a PEM may be combined with an acoustic optical 
modulator (AOM), where the AOM rapidly moves between the two structures while 

15 modulating the polarization state at a different (but related, such as multiple or submultiple) 
frequency. Signals at the sum and the difference of the PEM and AOM modulation 
frequencies contain useful information and can be detected with high signal-to-noise by 
synchronous detection. Alternatively the AOM on the incident beam could be used in 
combination with a PEM in the analyzer. 

20 Although not shown, the rotating waveplate may also be replaced by a polarization 

modulator in other types of scatterometric systems as for example a polarization sensitive 
reflectometer. 
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Scatterometric Overlay Database 

One aspect of the present invention provides a database of scatterometric overlay 
information that may be utilized for scatterometic overlay determination. 

In one implementation, one or more database are provided which include one or 
more libraries of overlay information. The database information is then used in overlay 
measurements. 

In one implementation, the libraries are compiled using predetermined test patterns 
with artificially-induced overlay. Alternatively, the libraries are produced using layer 
misregistrations programmed into the stepper. In another embodiment, the overlay that is 
induced or programmed has a progressive characteristic, varying within a particular range. 

The information stored into the database may include overlay data regarding the 
actual overlay printed on the wafer, as induced via the test pattern or by the stepper. 
Alternatively, this information is obtained from overlay actually measured on samples. The 
database may further store scatterometry measurement records associated with the overlay 
data. Such scatterometry measurement records may be obtained by performing actual 
scatterometric measurements of the overlay data. The database may also include 
information regarding materials, process conditions, optical parameters, and other relevant 
data. The database information may be further enhanced by interpolation and other 
preprocessing. 

The scatterometry database information may be utilized to improve the accuracy and 
speed of overlay measurements by retrieving scatterometry data associated with particular 
scatterometric measurements and process conditions recorded during actual measurements. 
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Dynamic selection of a measurement algorithm or methods may also be provided based on 
database lookups. A further implementation utilizes the database to calibrate scatterometric 
overlay measurement tools before or during production line measurements. 

Alternative Systems for Performing Scatterometry 

According to an embodiment of the invention, acquisition of the spectra A through D 
(and of additional spectra if present) is performed using an optical apparatus that may 
comprise any of the following or any combination of the following apparatus: an imaging 
reflectometer, an imaging spectroscopic reflectometer, a polarized spectroscopic imaging 
reflectometer, a scanning reflectometer system, a system with two or more reflectometers 
capable of parallel data acquisition, a system with two or more spectroscopic reflectometers 
capable of parallel data acquisition, a system with two or more polarized spectroscopic 
reflectometers capable of parallel data acquisition, a system with two or more polarized 
spectroscopic reflectometers capable of serial data acquisition without moving the wafer 
stage or moving any optical elements or the reflectometer stage, imaging spectrometers, 
imaging system with wavelength filter, imaging system with long-pass wavelength filter, 
imaging system with short-pass wavelength filter, imaging system without wavelength filter, 
interferometric imaging system (e.g. Linnik microscope, e.g. Linnik microscope as 
implemented in the KLA-Tencor overlay measurements tools models 5100, 5200, 5300, 
Archer 10, etc. available from KLA-Tencor of San Jose, CA), imaging ellipsometer, imaging 
spectroscopic ellipsometer, a scanning ellipsometer system, a system with two or more 
ellipsometers capable of parallel data acquisition, a system with two or more ellipsometers 
capable of serial data acquisition without moving the wafer stage or moving any optical 
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elements or the ellipsometer stage, a Michelson interfereometer, a Mach-Zehnder 
interferometer, or a Sagnac interferometer. 

Several embodiments of an interferometer based imaging spectrometer, as well as 
other types of imaging spectrometers such as filter based or the "push broom" approach, are 
5 described in U.S. Patent, having Patent No. 5,835,214, issued 10 November 1998, entitled 
"METHOD AND APPARATUS FOR SPECTRAL ANALYSIS OF IMAGES", by Cabib et 
al. System and Method embodiments for film thickness mapping with spectral imaging are 
described in U.S. Patent, having Patent No. 5,856,871, issued 5 January 1999, entitled 
"FILM THICKNESS MAPPING USING INTERFEROMETRIC SPECTRAL IMAGING", 
10 by Cabib et al. An alternative architecture for spectral imaging based on LED illumination 
is described in U.S. Patent, having Patent No. 6,142,629, issued 7 November 2000, entitled 
"SPECTRAL IMAGING USING ILLUMINATION OF PRESELECTED SPECTRAL 
CONTENT", by Adel et al. These patents are incorporated herein by reference in their 
entirety for all purposes. 

15 The imaging spectrometer or reflectometer used for acquisition of the spectra A 

through D from the four targets (and of additional spectra if present) according to an 
embodiment of the invention may be of the Fourier transform imaging spectrometer type as 
is well understood by those skilled in the art. The imaging system of the Fourier transform 
imaging spectrometer should be capable of separating (resolving) the reflected or scattered 

20 light signals from the different targets (or sections of a compound scatterometry overlay 
target). Alternatively the imaging spectrometer or reflectometer used for acquisition of 
scatterometry overlay signals may use a two-dimension detector where one axis contains the 
spatial information from the different scatterometry overlay targets (or sections of a 

KLA1P117/P1151 49 



compound scatterometry overlay target) and the other detector axis contains spectrally 
resolved information from light spectroscopically separated with a prism system or 
diffraction grating system, for example or a system that is a combination of a prism and a 
grating. The illumination radiation may be wavelength selected prior to incidence on the 
target. 

The spectra A through D obtained from the four targets (and additional spectra if 
present) detected in the imaging spectrometers, imaging reflectometers, or any of the other 
optical systems identified above in connection with various embodiments of the present 
invention may be unpolarized or selectively polarized. One or more of the unpolarized light 
or one or more of the polarization components of the reflected or scattered light from the 
targets may be detected with the imaging spectrometer or the imaging reflectometer. 

In various implementations, separate detection systems may be used to separately or 
simultaneously record one or more of the following light signals: unpolarized reflected light, 
polarized light with the electric field substantially parallel to one major symmetry axis of 
one layer of the scatterometry overlay targets, polarized light with the electric field 
substantially perpendicular to one major symmetry axis of one layer of the scatterometry 
overlay targets, polarized light with the electric field at an angle to one major symmetry axis 
of one layer of the scatterometry overlay targets, right-hand circularly polarized radiation, 
left-hand circularly polarized radiation, and/or a combination of two or more of the 
previously listed polarization states. A separate detector system may be used to 
simultaneously record the signal from part of the light source for the purposes of light noise 
monitoring, and/or light level control, and/or light noise subtraction or normalization. 
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Various possible implementations of various embodiments of the present invention 
are illustrated in co-pending U.S. Provisional Application No. 60/449,496, filed 22 February 
2003, entitled METHOD AND SYSTEM FOR DETERMINING OVERLAY ERROR 
BASED ON SCATTEROMETRY SIGNALS ACQUIRED FROM MULTIPLE OVERLAY 
5 MEASUREMENT PATTERNS, by Walter D. Mieher et al.. This provisional application is 
herein incorporated by reference in its entirety. 

In one embodiment, each of the four targets is illuminated by radiation produced by 
an optical system. The optical system may take the form of, among others, an optical 
source, a lensing system, a focusing system, a beam shaping system, and/or a directing 

10 system. In one embodiment, the radiation illuminating at least one of the targets is shaped as 
a radiation beam, with a relatively narrow beam cross section. In a particular 
implementation, the beam is a laser beam. The radiation illuminating the targets interacts 
with structures comprised within the targets and produces diffracted radiation components 
corresponding to each target and denoted as S A , S B , Sc, and So- In one embodiment, the 

15 illuminating beam is a broadband polarized beam comprising a broad spectral range as is 
commonly used in spectroscopic ellipsometry. 

2. Scatterometry Overlay Technique Alternatives: 

Several related techniques are described in the above related co-pending U.S. 
Provisional Applications. These related techniques may be easily integrated with the 
20 techniques described herein. 

In one embodiment of the invention, the targets (or compound scatterometry target 
sections) with different programmed offsets +/-F and +/-f0 as described above, or +/-F or 
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other similar target combinations, are grouped together to enable simultaneous signal 
acquisition. In one implementation targets are arranged in a line to enable data acquisition 
while scanning the wafer or some or all of the optics in one direction along the array of 
scatterometry overlay targets. Arranging the targets in a linear array may also enable use of 
an imaging spectrometer or reflectometer, where one detector axis separates the signals from 
the different targets (or target sections) and the other detector axis detects the spectral 
information. In this case the imaging system images a linear or cylindrical image of the 
linear target array into the prism or grating system. The imaging spectrometer or imaging 
reflectometer may contain an array of two or more lenses (known to those skilled in the art 
as a lenslet array) to separate and direct the reflected or scattered light from different targets 
or target sections. 

In one embodiment, the primary offset F is optimized to provide larger or maximum 
sensitivity to overlay errors. For instance, an offset F equal to X A of the pitch of the target 
provides high overlay sensitivity since it is half-way in-between the two symmetry points 
where overlay error sensitivity is minimum. The secondary offset fO may be chosen such that 
the fO is outside the region of interest for overlay measurements, such as equal to or beyond 
the specification limits, but it should not cause the uncertainty of the overlay measurement to 
allow the error that an out-of-spec measurement can be interpreted as within specifications. 
Nevertheless, this is not a limitation on the range of fO. A large fO may decrease the 
accuracy of the overlay measurements for overlay errors E between -fO and +f0. For 
overlay errors E larger than |f0|, the accuracy of the overlay measurement may be reduced 
due to extrapolation beyond the region -TO to +f0 and the accuracy of the linear 
approximation may also be reduced. 
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Overlay measurements are most commonly done at or near the four corners of the 
stepper field (sometimes with an additional measurement near the center of the field) in 5 to 
25 fields per wafer in semiconductor manufacturing processes. For a system of four targets 
used to determine overlay in the x direction and four targets used to determine overlay in the 
y direction, according to an embodiment of the present invention, a total of 8*4*5=160 
measurements of scatterometry overlay targets may be used to determine the two 
dimensional overlay for a common overlay measurement sampling plan. More 
measurements may be conducted for more detailed sampling plans. 

According to another embodiment of the invention, a total of six targets (three for x 
and three for y, for example) can be used to determine two dimensional overlay for the 
sample. This may facilitate further simplification of the overlay metrology process, 
reduction in processing resources, and decrease of the time used in the metrology process. 
In yet other implementations, additional targets or additional pairs of targets may be 
produced on the sample and used in a substantially similar manner with that described herein 
for determination of overlay based on scatterometry, but adjusted for the increased number 
of targets and corresponding number of diffracted radiation components. The mathematical 
methods for determination of the overlay error E can be similarly adjusted to exploit the 
availability of increased information provided by such additional targets or additional pairs, 
including by possibly accounting for higher order approximation terms in the formula for the 
overlay error E. 

Scatterometric Overlay Determination With Limited Refocusing 

To improve the accuracy of scatterometry overlay determination, more than one 
measurement is preferably carried out. One implementation utilizes a plurality of 
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scatterometry overlay targets, and for each target, the system makes one scatterometric 
measurement of overlay. Another implementation utilizes a single scatterometric target, or a 
single scatterometric target area that comprises multiple target sub-regions, and more than 
one scatterometric overlay measurement is performed for that target or target area. In yet 
another embodiment, a plurality of targets or target regions are used, and more than one 
measurement is performed for some or all of the targets or target regions. 

Conventionally, the optical system is refocused for each individual measurement. 
This, however, can consume a lot of time thus decreasing the processing speed of the 
system. For example, each focus sequence may take between 0. 1 and 1 seconds, and each 
wafer may include between 30 to 70 sites with each site consisting of 8 targets. Using these 
numbers, refocusing may take up to as much as 560 seconds for each wafer. Considering 
there are typically 100s and 1000s of wafers to be inspected this number may be further 
increased to a completely unacceptable level. 

In accordance with one embodiment of the present invention, therefore, multiple 
scatterometry overlay measurements are performed with limited optical refocusing in order 
to increase the processing speed and throughput of the system. By limited optical 
refocusing, it is generally meant that at least some new measurements are performed without 
refocusing the optical system, i.e., multiple measurements are made with the same focus 
setting. For instance, the optical system may be initialized with a focus setting that is 
optimized for a plurality of scatterometric measurements that will be performed, and no 
further refocusing takes place during these individual scatterometric measurements. The 
optimized focus setting may be found once for the entire wafer, or it may be found 
periodically. When periodic, the focus setting may be established at preset increments of 
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time during inspection (e.g., every 30 seconds), for a particular location on the wafer (e.g., 
every 2 x2 cm2 of wafer), for a particular characteristic of the target (e.g., similar line widths 
and spacing) and the like. 

In one embodiment, the wafer includes a plurality of focus zones. Each of the focus 
zones is initialized with a focus setting that is optimized for all of scatterometric 
measurements that will be performed within the focus zone. Refocusing does not occur 
between individual scatterometric measurements inside the focus zone. As such, each target 
within the focus zone is measured with the same optimized focus setting. Any number of 
focus zones may be used. 

The configuration of the focus zones may be widely varied. In one implementation, 
the focus zones correspond to a portion of the wafer. By way of example, the wafer may be 
broken up into plurality of radial focus zones emanating at the center of the wafer and 
working outwards, or into a plurality of angular focus zones, which separate the wafer into 
multiple quadrants. In another implementation, the focus zones correspond to a particular 
set of targets as for example, the targets at the corners of each semiconductor device. In 
another implementation, the focus zone corresponds to a particular target area that includes a 
plurality of targets (see for example 9A). In another implementation, the focus zones 
correspond to a particular target sub-region within the target areas (as for example the x or y 
directed group of targets shown in Fig. 9B). In yet another implementation, the focus zone 
corresponds to a particular sub region within the target itself. 

A method of determining overlay will now be described. The method generally 
includes optimizing the focus setting of a first zone. The method also includes performing a 
first set of measurements on a plurality of targets within the first zone. Each of the targets 
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within the first zone is measured using the optimized focus setting of the first zone. That is, 
a first target is measured, and thereafter a second target is measured without refocusing the 
optical system. Any number of targets can be measured in this manner. The method further 
includes optimizing the focus setting of a second zone. The method additionally includes 
performing a second set of measurements on a plurality of targets within the second zone. 
Each of the targets within the second zone is measured using the optimized focus setting of 
the second zone. That is, a first target is measured, and thereafter a second target is 
measured without refocusing the optical system. Any number of targets can be measured in 
this manner. 

In one example of this method, the first and second zones may represent different 
target areas that include a plurality of targets (See Fig. 9A). In this example, each of the 
targets are located in close proximity to one another and therefore it can be assumed that 
variations in focus from one target to the next are minimal. The method generally includes 
optimizing the focus setting in the target area, and thereafter measuring each of the targets in 
the target area with the optimized focus setting. For example, the first target is measured, 
thereafter the adjacent target is measured, and so on without ever refocusing the optical 
system. When a first target area is measured, the system may repeat these steps on a second 
target area, as for example, a target area located at a different corner of the device. 

In another example of this method, the first and second zones may represent sub 
regions with a target area that includes a plurality of targets. The sub regions may for 
example represent different target orientations (See Fig. 9B). The method generally includes 
optimizing the focus setting in the first sub regions (e.g., targets along the x axis), and 
thereafter measuring each of the targets in the sub region with the optimized focus setting. 
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For example, the first target is measured, thereafter the adjacent target is measured, and so 
on without ever refocusing the optical system. When the first sub region is measured, the 
method continues by optimizing the focus setting in the second sub regions (e.g., targets 
along the y axis), and thereafter measuring each of the targets in the sub region with the 
optimized focus setting. For example, the first target is measured, thereafter the adjacent 
target is measured, and so on without ever refocusing the optical system. In another example 
the system is refocused prior to the measurement on the first scatterometry overlay target in 
an xy scatterometry overlay target group. After the scatterometry signal is measured for the 
first target in the xy overlay target group, the rest of the targets may be measured without 
refocusing. For example, an xy overlay target group comprises four scatterometry overlay 
targets for an overlay error determination in the x direction and four scatterometry overlay 
targets for an overlay error determination in the y direction. 

Scatterometric Overlay Determination Using a Line Image 

A system for scatterometric measurement of overlay using a line image may also be 
implemented. 

In one embodiment, a scatterometric target is illuminated along a single incidence 
line. The scattered radiation is intercepted by a dispersive system, such as a prism or 
diffraction grating. The scattered radiation is thereby dispersed as a function of wavelength. 
The dispersed radiation is then captured by a detector, such as a CCD camera. If the camera 
is properly aligned, the radiation entering the field of view has a two dimensional profile 
with points along the incidence line distributed along the X-axis of the field of view, and 
various wavelengths dispersed along the Y-axis. An example incidence line and field of 
view are illustrated in Fig. 10. 
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The image captured by the camera can then be processed at pixel level to determine 
overlay, possibly using the FT approach disclosed herein. Once overlay is measured along a 
particular incidence line, the wafer could be rotated by 90 degrees (or by any arbitrary angle) 
to measure overlay in a different direction. An advantage of the present invention is that 
overlay may be measured in more than one direction using a single optical system. 

An alternative to illuminating a single incident line is illuminating a larger area but 
only capturing scattered radiation along a detection line. The description provided above 
also applies to this embodiment, with appropriate modifications. 

Algorithms 

Various algorithms and methods may be employed for more efficiently and 
accurately measuring overlay based on scatterometry. 

Old methods of performing such calculations use a model- based method, or a 
differential method for calculating overlay. These conventional methods lack the accuracy 
that may be achieved by combining multiple algorithms for refining and cross checking 
results. Also, these methods do not make good use of pre-existing information (like CD or 
profile data). 

In one general algorithm implementation, overlay is determined using data from a 
plurality of separate calculations of different product parameters. 

In a first embodiment, a first calculation of overlay is performed according to a first 
technique (such as the differential method). A second calculation of overlay is then 
performed according to a second technique (such as a model-based regression). The results 
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are then combined from the two calculations. The results may be combined in various ways. 
For example, one calculation may be used to cross check another. Or one calculation may be 
used to provide initial values to speed up the other calculation. Other combinations may also 
be used. 

In a second embodiment, the speed and/or accuracy of an overlay measurement are 
enhanced by making use of other measured data. For example, film thickness data from the 
layers making up the target may be fed into the algorithm. Such film thickness data could be 
measured using an appropriate tool, such as an ellipsometer or reflectometer. Alternatively 
(or additionally), CD data could be provided from an SCD measurement (scatterometry 
critical dimension or scatterometry profile measurement) and used to speed up or improve 
the accuracy of the scatterometry calculations. Other data from a scatterometry profile 
measurement could be similarly used, such as height or three dimensional profile 
information. Other sources of CD data, like a CD SEM, could be used. 

Combined Scatterometry and Imaging Targets 

In an alternative implementation, the targets are designed for an imaging based 
overlay metrology application, as well as for the above described scatterometry analysis. In 
other words, the scatterometry and imaging target structures are tightly integrated so that 
scatterometry may be performed in conjunction with an image based overlay measurement. 
Preferably, the scatterometry target pairs are symmetrically positioned about the center of 
the field of view. If symmetry is preserved in the illumination and collection channels of the 
imaging system, tool induced shift and will be minimized. By example, Xa and Xa' are twin 
(have similar magnitude but opposite sign offset) targets in the x direction. (Here Xa and Xa' 
may correspond to the targets Xa and Xd in Figure 1). Likewise, Xb and Xb' are opposites. 
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(Here Xb and Xb' may correspond to the targets Xb and Xc in Figure 1). In the y direction, 
targets Ya and Ya' are opposites, while Yb and Yb' are opposites. 

Figure 11a is a top view representation of a first combination imaging and 
scatterometry target embodiment. In this example, the target arrangement includes a set of 
5 four x direction targets for determining overlay using scatterometry and a set of four y 
direction targets for determining overlay using scatterometry. The targets are laid out so that 
adjacent targets (with respect to the overlay measurement direction) have an opposite offset. 
In the illustrated example, target Xa has an opposite offset than target Xa\ and target Xb has 
an opposite offset than target Xb'. Likewise, targets Ya and Ya' have opposite offsets, and 
10 targets Yb and Yb' have opposite offsets. In this example, the targets also include structures 
which can be used for imaged based overlay determination. 

In the illustrated example, the target arrangement includes a black border structure 
1 104 on a first layer and a gray cross-shaped structure 1 102 on a second layer. Using image 
analysis methods, the center of the black structure 1104 may then be compared with the 
15 center of the gray structure 1 102 to determine an overlay error (if any). 

Although this set of targets have an overall rectangular shape which extends longer in 
the x direction than the y direction, of course, the targets could have other shapes (e.g., 
square or any symmetrical polygon) and/or extend longer in a direction other than x. 

In other combinational target arrangements, the imaging structures are laid out in the 
20 center of a symmetrically arranged set of scatterometry targets. Figure lib is a top view 
representation of a second combination imaging and scatterometry target embodiment. As 
shown, scatterometry targets are symmetrically arranged around a central image type target 
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152. In this example, the image type target 1 152 is formed from quadrants of line segments, 
where each quadrant is either in the x or y direction. Suitable image type targets and 
techniques for determining overlay with same are described in the following U.S. patents 
and applications: (1) Patent No. 6,462,818, issued 8 October 2002, entitled "OVERLAY 
5 ALIGNMENT MARK DESIGN", by Bareket, (2) Patent No. 6,023,338, issued 8 February 
2000, entitled "OVERLAY ALIGNMENT MEASUREMENT OF WAFER", by Bareket, (3) 
Application No. 09/894,987, filed 27 June 2001, entitled "OVERLAY MARKS, METHODS 
OF OVERLAY MARK DESIGN AND METHODS OF OVERLAY MEASUREMENTS", 
by Ghinovker et al., and (4) Patent No. 6,486,954, issued 26 November 2002, entitled 
10 "OVERLAY ALIGNMENT MEASUREMENT MARK" by Levy et al. These patents and 
applications are all incorporated herein by reference in their entirety. 

Figure 11c is a top view representation of a third combination imaging and 
scatterometry target embodiment. This target arrangement has scatterometry target 
symmetrically arranged around a box-in-box type target 1 154. A box-in-box target generally 
15 includes a first inner box formed from a first layer surrounded by a second outer box 
structure formed in a second layer. The centers of the inner box structures may be compared 
to the center of the outer box structures to determine overlay error (if present). 

The above targets may be imaged in any suitable manner (e.g., as described in the 
above referenced patents and applications by Bareket, Ghinovker et al., and Levy et al.) to 
20 determine overlay. The target arrangements may also simultaneously or sequentially 
measured with any desirable optical tool as described herein to determine overlay using 
scatterometry techniques. In an alternative embodiment, the scatterometry targets may be 
simultaneously imaged along with the imaging type target structures. The resulting image 
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may be subdivided into the separate scatterometry targets and then the scatterometry 
techniques applied to the image signals for each target (e.g., intensity). The image may be 
obtained at the same time as, or before or after the scatterometry overlay measurements. The 
imaging system may be a high-resolution microscope such as the system in the KLA-Tencor 
5300 or Archer overlay measurement systems available from KLA-Tencor of San Jose, CA. 
Alternatively, the imaging system may be a lower resolution imaging system used for other 
purposes that may include wafer alignment or pattern recognition. 

Mask Alignment During Imprint Lithography 

Because the mask and sample are typically in close proximity (separated by the fluid 
to be polymerized) during nano-imprint lithography, the patterned surface of the mask, the 
fluid, and the patterned sample to be aligned to can be considered to be functionally 
equivalent to a scatterometry overlay target. All of the methods, techniques and targets 
defined for scatterometry overly would then be applicable to alignment procedures. 

In one embodiment, the measurement instrument would project radiation (preferably 
light) through the mask and onto an area of the mask and wafer which contains one or more 
scatterometry overlay targets. The change in properties of the reflected light due to 
scattering or diffraction may then be used to determine the offset between the pattern on the 
mask and the pattern on the wafer. The wafer is then moved relative to the mask (or vice 
versa) to achieve the desired offset. A more accurate alignment may the be achieved, rather 
than with conventional alignment techniques such as direct imaging or moire techniques. 
The instrument could be a reflectometer, polarized reflectometer, spectrometer, imaging 
reflectometer, imaging interferometer, or other such instrument as described herein or in the 
above referenced provisional applications. 
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Disposition of Scatterometrv Overlay Targets 

The accuracy of scatterometry overlay systems can be improved by taking 
measurements at multiple targets located across the surface of interest. In one 
implementation, the scatterometry overlay system may utilize a plurality of scatterometry 
5 targets at various locations across the surface of interest and for each target the system may 
make one scatterometric measurement of overlay. In another implementation, the 
scatterometry overlay system may utilize a plurality of scatterometry target areas at various 
locations across the surface of interest. The scatterometry target areas comprise multiple 
targets, each of which can be measured by the scatterometry overlay system. By way of 
10 example, the scatterometry targets or scatterometric target areas may be located at the 
comers of one or more devices being formed on a wafer. In addition, the scatterometry 
targets may generally include a grating structure, which is measurable by the scatterometry 
overlay system. 

The number of targets generally depends on the available space on the surface of 
15 interest. In most cases, the targets are placed in the scribe line between devices on a wafer. 
The scribe line is the place on the wafer where the wafer is separated into dies via sawing or 
dicing and thus the circuit itself is not patterned there. In cases such as this, the number of 
targets may be limited, at least in part, by the narrowness of the scribeline. As should be 
appreciated, the scribe lines tend to be narrow so as to maximize the amount of devices on 
20 the wafer. 

In accordance with one embodiment of the present invention, the targets are 
strategically placed on the surface of interest in order to overcome any space constraints 
while increasing the number of targets. In one implementation, at least two targets are 
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placed substantially collinearly in a first direction. For example, they may be placed 
collinearly in the x-direction or the y-direction. This arrangement may be useful when 
confronted with narrow spaces as for example scribe lines. In another implementation, 
multiple targets are disposed collinearly in multiple directions. For example, multiple 
targets may be disposed collinearly in both the x direction and the y-direction. This 
arrangement may be useful at the corner of a device as for example at the intersection of two 
scribe lines. 

Although the examples given are directed at a Cartesian coordinate system as defined 
on the surface of interest, it should be noted that the coordinate system may be oriented 
arbitrarily on the surface or interest with the x and y axis being rotated or possibly 
interchanged. Alternatively or in combination with the Cartesian coordinate system, any 
other coordinate system may be used such as for example, a polar coordinate system. 

Fig. 9A is a top view diagram of a scatterometric target area 900 having one 
or more targets 902, in accordance with one embodiment of the present invention. The 
scatterometric targets 902 are generally provided to determine the relative shift between two 
or more successive layers of a substrate or between two or more separately generated 
patterns on a single layer of a substrate. By way of example, the scatterometric targets may 
be used to determine how accurately a first layer aligns with respect to a second layer 
disposed above or below it or how accurately a first pattern aligns relative to a preceding or 
succeeding second pattern disposed on the same layer. 

As shown in Fig. 9A, the scatterometric target area 900 includes at least two 
substantially collinear targets 902. By collinear, it is generally meant that the centers of 
symmetry for each of the targets 902 lie on the same axis 904. By way of example, the axis 
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904 may be aligned with a conventional coordinate system (Cartesian, polar, etc.) or some 
variation thereof. By placing the targets 902 collinearly, the scatterometric target area 900 
does not take up as much width W and therefore may be placed in constrained places as for 
example in the scribeline of the wafer. 

The targets 902 are generally juxtaposed relative to one another along the axis 904. 
In most cases, the juxtaposed targets 902 are spatially separated from one another so that 
they do not overlap portions of an adjacent target 902. Each of the targets 902 is therefore 
distinct, i.e., represents a different area on the substrate. This is typically done to ensure that 
each of the targets 902 is properly measured. The space 906 between targets 902 produces 
distortions in the optical signal and therefore it is excluded from the overlay calculation. 
The size of the space 906 is typically balanced with the size of the targets 902 so as to 
provide as much information as possible for the measurement of overlay. That is, it is 
generally desired to have larger targets 902 and smaller spaces 906 there between. The 
space 906 between targets 902 may be referred to as an exclusion zone. 

The targets 902 may be widely varied, and may generally correspond to any of those 
overlay targets that can be measured via scatterometry. By way of example, the targets 902 
may generally include one or more grating structures 908 having parallel segmented lines 
910. Although not a requirement, the segmented lines 910 for the collinear targets 902 are 
generally positioned in the same direction, which may be parallel or transverse to the axis 
904. In most cases, some of the segmented lines 910 are perpendicular to the axis 904 and 
some are parallel to the axis 904 to enable overlay measurements in x and y. Furthermore, 
the targets 902 may have an identical configuration or they may have a different 
configuration. Configuration may for example include the overall shape and size of the 
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target 902 or perhaps the line width and spacing of the segmented lines 910 associated with 
the grating structure 908 contained within the target 902. Preferably the targets used for the 
overlay measurement in a particular direction, for example x direction, are designed to have 
the same configuration except for the programmed or designed overlay offsets. 

The number of targets may also be widely varied. As should be appreciated, 
increasing the number of targets, increases the number of data collection points and therefore 
the accuracy of the measurement. The number of targets 902 generally depends on the 
overall size of the targets 902 and the space constraints in the direction of the axis 904. In 
the illustrated embodiment, eight side by side targets 902 are positioned within the 
scatterometric target area 900. A scatterometric target area may be equivalent to an xy 
scatterometry overlay target group as discussed above. 

Using the above mentioned targets 902, scatterometric overlay measurements may be 
made sequentially, one target at a time, to measure overlay while eliminating effects due to 
variations in other sample parameters, such as film thickness. This can be accomplished via 
continuously scanning of the scatterometric target area (including for example the targets 
and the spaces there between) or by stepping to each of the targets. Alternatively, 
measurements may take place substantially simultaneously using two or more scatterometry 
signal beams for two, more than two, or all targets to increase throughput. The multiple 
scatterometry signal beams may come from more than one substantially independent 
scatterometry optical systems, or they may share much of the optical system, for example 
they may share the same light source, the same beam directing optics, or the same detector 
system. 
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Although the method described above includes placing the centers of symmetry for 
each of the targets substantially collinear, it should be noted that the centers of symmetry 
may be offset from the axis so long as a measurable portion of the targets still falls on the 
same axis. 

Furthermore, although the method described above includes placing targets of similar 
orientation along the same axis, it should be noted that some of the targets may be positioned 
with a different orientation. For example, a first group of the targets 902 may have 
segmented lines positioned in the x dimension while a second group of the targets 902 may 
have segmented lines position in the y dimension. 

Moreover, although the targets 902 are only shown positioned along a single axis 
904, it should be noted that the targets may be positioned on multiple axis. For example, as 
shown in Fig. 9B, a first group of targets 902A may be disposed collinearly along a first axis 
904A and a second group of targets 902B may be disposed collinearly along a second axis 
904B. This implementation permits independent measurement of overlay in at least two 
directions. The first and second axis are typically transverse to one another and more 
particularly perpendicular one another. In the illustrated embodiment, the first axis 904A 
corresponds to the X-dimension while the second axis 904B corresponds the Y-dimension. 
Furthermore, each group consists of four targets 902. This implementation permits 
independent measurement of overlay in the X and Y directions. 

Further still, although the targets have been described as having features (e.g., 
segmented lines) in substantially one direction, it should be noted that the targets may 
include features in more than one direction. In one implementation, for example, one or 
more of the collinearly positioned targets include features that permit scatterometric overlay 
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measurement in first and second directions. By way of example, the features such as the 
segmented lines may be positioned in both the X and Y dimensions. In this case, the need for 
disposing targets along more than one axis as shown in Fig. 9B is reduced or eliminated. 
That is, if each target has features that permit two-dimensional scatterometric measurements, 
overlay may be determined along both the X- and Y-axes using a single set of targets 
disposed substantially-collinearly along a single axis. Alternatively, one or more targets 
may include one or more sub-targets. If the sub-targets have features that permit two- 
dimensional scatterometric measurements, the number of targets desirable for a particular 
degree of measurement accuracy may be reduced, and the targets may be disposed along a 
single line. 

Additionally, targets disposed along one or more axis may be used for measurement 
of more then one parameter. For example, a first set of targets may be used for 
scatterometric measurement of wavelength along the X-axis and a second set of targets may 
be used for scatterometric measurement of spatial resolution along the Y-axis. In an 
alternative implementation, scatterometric measurement of spatial resolution may be 
performed along the X-axis while spectral measurements are performed along the Y-axis. 

Combined CD and Overlay Marks 

Scatterometry measurement targets consume a significant area of the wafer for both 
metrology of CD and overlay. This wafer area becomes very valuable as design rule shrinks. 
Currently, scatterometry overlay marks may consume >35x70um space for each xy 
scatterometry overlay target group or mark on the wafer. These are used only for overlay 
measurements and therefore the manufacturers consider the loss of wafer space as 
undesirable. Therefore, it is desirable to reduce the total wafer area required for 
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measurement targets or measurement features. Changes to optical system design to enable 
measurements on smaller targets may result in greater complexity of the optical system and 
potentially compromise measurement performance. In scatterometry overlay measurement as 
describe herein, the target area typically consists of four gratings for each axis (X and Y). 
Each of these gratings is typically larger than 15x1 Sum with a limited opportunity to shrink 
it further using conventional techniques. Each grating is composed of a first layer grating 
(e.g. STI) and a top layer grating (e.g. gate.). Each of the two layers has a programmed 
offset, which is typically smaller than the pitch of the top grating. In many cases the top 
layer is photoresist. An overlay measurement is achieved by analyzing the spectra of a 
reflected light from each of these gratings. 

In scatterometry critical dimension (CD) measurement, the target area typically 
consists of a single grating, which may be positioned along either axis (X or Y). In some 
cases, the target area may include multiple gratings for each axis (X and Y). Each of these 
gratings is typically about 50x50um. The measurement is typically performed on a single 
process layer target with no pattern underneath. This measurement is typically done on a 
photoresist pattern following a resist development step in a lithographic patterning process 
or following an etch or CMP process in other modules of the fabrication. A CD 
measurement is achieved by analyzing the spectra of a reflected light from the grating(s) as 
described in the above referenced U.S. Patent No. 6,590,656 by Xu, et al. 

In accordance with one embodiment of the present invention, the scatterometry CD 
marks and the scatterometry overlay marks are combined to enable the fab to save wafer 
space and to print larger scatterometry overlay marks with no impact on the wafer scribe- 
line. The combined mark is constructed with a scatterometry CD target as the first layer and 
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scatterometry overlay target patterns as the top layer. This results in zero or minimal 
additional scribe line space allocated to scatterometry overlay. 

Fig. 12 is a diagram of a combined mark 1200, in accordance with one embodiment 
of the present invention. The combined mark 1200 provides for both scatterometry CD 
5 measurement and scatterometry overlay measurement at different steps in the wafer 
manufacturing process. The combined mark 1200 is formed on at least two layers of the 
wafer, particularly a first layer LI and a top layer L2. The first layer LI includes 
scatterometry CD/profile targets 1202 and the top layer L2 includes scatterometry overlay 
targets 1204. Although shown as separate layers in the diagram, it should be noted that the 

10 scatterometry overlay targets 1204 are built on (over) the scatterometry CD profile targets 
1202. The scatterometry CD/profile targets 1202 form an LI scatterometry CD mark, which 
can be measured to determine CD after formation or processing of the LI pattern. The 
scatterometry overlay targets 1204 cooperate with the scatterometry CD/profile targets 1202 
to form an L2-L1 scatterometry overlay mark, which can be measured to determine overlay 

15 between the layers after formation of the L2 pattern (which comes after LI pattern 
formation). As should be readily apparent, this method may be repeated to produce a layer 2 
L2 scatterometry CD/profile target(s) followed by a Layer 3 L3 pattern to create an L3-L2 
scatterometry overlay mark or target area. 

The configuration of the scatterometry CD/profile targets 1202 and scatterometry 
20 overlay targets 1204 may be widely varied. In the illustrated embodiment, the scatterometry 
CD/profile targets 1202 disposed on LI include a first grating 1206 oriented in a first 
direction and a second grating 1208 oriented in a second direction. The first direction may 
be orthogonal to the second direction. By way of example, the first grating 1206 may 
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include vertical lines while the second grating 1208 may include horizontal lines. In 
addition, the scatterometry overlay targets 1204 disposed on L2 include a first group of 
gratings 1210 and a second group of gratings 1212. Both the first and second groups of 
gratings 1210, 1212 include one or more gratings 1214. The number of gratings 1214 may 
be widely varied. In one implementation, both the first and second groups 1210 and 1212 
include four gratings 1214. The gratings 1214A in the first group 1210 are oriented in the 
first direction, and the gratings 1214B in the second group 1212 are oriented in the second 
direction. By way of example, the gratings 1214A in the first group 1210 may include 
vertical lines while the gratings 1 2 14B in the second group 1212 may include horizontal 
lines. 

In order to produce an L2-L1 overlay mark, the first group of gratings 1210 is 
positioned over the first grating 1206 of the CD/profile targets 1202, and the second group of 
gratings 1212 is positioned over the second grating 1208 of the CD/Profile targets 1202. 
This places gratings with similarly directed lines together, i.e., vertical lines with vertical 
lines and horizontal lines with horizontal lines. The first group of gratings 1210 cooperates 
with the first grating 1206 of the CD/profile targets 1202 and the second group of gratings 
1212 cooperates with the second grating 1208 of the CD/profile targets 1202. The alignment 
between layers is determined by the shift produced between the corresponding lines of these 
cooperating structures. The vertical lines, for example, may be used to determine X overlay 
and the horizontal lines, for example, may be used to determine Y overlay. 

Although the first and second gratings 1206 and 1208 of the CD mark are shown 
together, it should be noted that they may be placed apart. When implemented apart, the 
first group of gratings 1210 and the second group of gratings 1212 would also be placed 
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apart, i.e., the first group of gratings 1210 goes with the first grating 1206 and the second 
group of gratings 1212 goes with the second grating 1208. 

The advantages of combining overlay and CD marks are numerous. Different 
embodiments or implementations may have one or more of the following advantages. One 
5 advantage of combining marks is in the ability to reduce the need for additional wafer space 
for scatterometry overlay targets. Another advantage is that larger scatterometry overlay 
targets may be allowed if they do not require as much additional scribe line space. Larger 
scatterometry overlay targets may make optical design or optics manufacturing easier and 
may provide better scatterometry overlay metrology performance than on smaller 
10 scatterometry overlay targets. 

Combination of Scatterometry Overlay and CDSEM 

The purpose of this embodiment is to enable measurement of critical dimensions on a 
semiconductor wafer with an electronic microscope (CD-SEM) and measurement of overlay 
using scatterometry on the same measurement system or using linked measurement systems 

15 sharing at least part of a robotic wafer handling system. The established methods of 
measuring critical dimensions and overlay commonly require scheduling and operating 
separate measurement systems. One disadvantage of the established methods of measuring 
critical dimensions and overlay on separate measurement systems is the additional time 
required to schedule and run separate operations on separate metrology tools. Another 

20 disadvantage is the redundancy of common parts and the costs associated therewith. 

In order to overcome these disadvantages, a metrology system that combines 
Scatterometry Overlay and CDSEM may be provided. In one embodiment, a scatterometry 
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overlay measurement (SCOL) system is integrated with a CDSEM system such that the 
CDSEM and SCOL systems share at least part of the robotic wafer handling system and/or 
data systems. Alternatively, the CDSEM and the scatterometry overlay systems may be 
separate systems capable of independent operation, but linked in such a way that they share 
at least part of a robotic wafer handling system. 

In operation, a wafer, a group of wafers, or batch of multiple wafers may be 
introduced to the combined metrology system by loading the wafer container onto the 
robotic wafer handling system dedicated to this combined metrology system. Measurement 
recipes may be selected specifying CDSEM measurements on some or all of the wafers and 
scatterometry overlay measurements on some or all of the wafers. The CDSEM 
measurements and the SCOL measurements may be specified together in one or more 
recipes, or may be specified in separate recipes. The CDSEM and SCOL measurements may 
be done on the same wafers or on different wafers or on some of the same wafer and some 
different wafers. The CDSEM and SCOL systems may operate in parallel, or in series. 

One example of the combined metrology system would be integration of a 
scatterometry system capable of scatterometry overlay measurements (such as a 
spectroscopic ellipsometer, spectroscopic polarized reflectometer, or +/-1 order diffraction 
scatterometer) inside a CD-SEM such as any of those manufactured by KLA-Tencor of San 
Jose, CA. Another example of a combined metrology system would be a linked system 
comprising a scatterometry overlay system, a CDSEM such as any of those manufactured by 
KLA-Tencor of San Jose, CA, a robotic handler, and a wafer scheduling system. 
Communication to factory automation and/or factory information, and/or factory process 
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control systems may be through separate communication or automation systems or may be at 
least partially or completed shared. 

One advantage of the combined CDSEM and SCOL metrology system is the 
reduction in overall time required to complete scheduling and/or performing the CDSEM 
and scatterometry overlay measurements. At least one queue delay time may be eliminated. 
Performing CDSEM and overlay measurements in parallel can save at least part of the time 
required for separate measurement operations. 

Figs. 13A - 13D show variations of a combined metrology tool 1300, in accordance 
with several embodiment of the present invention. In all the figures, the combined 
metrology tool 1300 includes a robotic wafer handling system 1302, a critical dimensioning 
scanning electron microscope (CD-SEM) 1304, a scatterometry overlay (SCOL) 
measurement instrument 1306, a wafer load position A 1308 and a wafer load position B and 
1310, respectively. The robotic wafer handling system 1302 is configured to transfer wafers 
to and from the CD-SEM 1304 and SCOL measurement instrument 1306 as well as to and 
from the wafer load positions A and B 1308 and 1310. The critical dimensioning scanning 
microscope 1304 is configured to measure the critical dimensions that may include, for 
example, linewidth, top linewidth, via diameter, sidewall angle and profile. The 
scatterometry overlay measurement instrument 1306 is configured to measure the overlay as 
for example between two layers disposed on the wafer. The wafer load position A and wafer 
load position B are configured to hold one or more wafers. In most cases, they hold a 
plurality of wafers. The wafers may be from the same lot of from a different lot. 

In Figs. 13A and D, the CD-SEM 1304 and the SCOL measurement instrument 1306 
are separate systems that are integrated via the robotic wafer handling system 1302. In Fig. 
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13B, the SCOL measurement instrument 1306 is integrated into the CDSEM 1304. In Fig. 
13C, the SCOL measurement instrument 1306 is integrated into the robotic wafer handling 
system 1302. 

In one operation, some of the wafers from wafer load position A and/or B have 
critical dimensions measured at the CD-SEM and thereafter have overlay measured at the 
scatterometry overlay measurement instrument. The wafer can be measured by both 
processes without being removed from the system, i.e., the wafer handling as well as the 
throughput issues associated therewith are reduced. In another operation, some wafers from 
wafer load position A and/or B have critical dimensions measured at the CDSEM and some 
other wafers from wafer load position A and/or B have overlay measured on SCOL 
measurement instrument. In any of these operations, the CDSEM and SCOL measurement 
instrument can proceed independently and simultaneously. 

Fig. 14 is a flow diagram 1400 using a combined metrology tool, in accordance with 
one embodiment of the present invention. The method generally includes step 1402 where a 
group of wafers are received by the metrology tool. By way of example, the wafers may be 
a wafer lot that is loaded at position A in Fig. 13. Following step 1402, the process flow 
1400 proceeds to step 1404 where the critical dimensions of a wafer from the group of 
wafers is measured. By way of example, the critical dimension measurements may be 
performed by a CDSEM as for example the CDSEM shown in Fig. 13. The process flow 
1400 also proceeds to step 1406 where the overlay of a wafer from the group of wafers is 
performed by a SCOL measurement instrument as for example the instrument shown in Fig. 
13. Steps 1404 and 1406 may be performed at the same time on different wafers. Steps 
1404 and 1406 may be performed on the same wafer in a sequence of operations, as for 
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example, from CD to overlay or from overlay to CD. The transferring of the wafer may for 
example be performed by the robotic system shown in Fig. 13. When all the measurements 
are performed, the process flow proceeds to step 1408 where the group of wafers are 
released from the metrology tool. 

Combination of Scatterometrv Overlay and Other Metrology or Inspection Methods 

Scatterometry overlay may be combined with scatterometry profile or scatterometry 
critical dimension systems, or other semiconductor metrology or inspections systems. 
Scatterometry overlay may be integrated with a semiconductor process tool, for example a 
lithography resist process tool (also known as a resist track). Integration of metrology 
systems with process systems and combinations of metrology systems are described in (1) 
U.S. patent application, having Patent No. 09/849,622, filed 04 May 2001, entitled 
"METHOD AND SYSTEMS FOR LITHOGRAPHY PROCESS CONTROL", by 
Lakkapragada, Suresh, et al. and (2) U.S. patent, having Patent No. 6,633,831, issued 14 
October 2003, entitled "METHODS AND SYSTEMS FOR DETERMINING CRITICAL 
DIMENSION AND A THIN FILM CHARACTERISTIC OF A SPECIMAN", by 
Nikoonahad et al, which applications are incorporated herein by reference in their entirety. 

Although the foregoing invention has been described in some detail for purposes of 
clarity of understanding, it will be apparent that certain changes and modifications may be 
practiced within the scope of the appended claims. Therefore, the described embodiments 
should be taken as illustrative and not restrictive, and the invention should not be limited to 
the details given herein but should be defined by the following claims and their full scope of 
equivalents. 
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